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02/06/2000 



PrkJilty uale (<Uky/monthty*wr) 
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Applicant 

THE GOVERNMENT UH THE UNITEP STATES OF AMERICA.., 



1 The applicant is hereby notified II tat this International Preliminary Examining Authority transmits herewith the 
" international preliminary examination roport and its annexes, it any. established on The International application, 

2. A copy of the report and Its annexes, if any, is being transmitted to the international Bureau for communication 
to all the elected Offices. 

a Where required by any of the elected Offices, the International Bureau will prepare an English translation of the 
rnport (but not of any annexes) and will transmit euch translation to thosa Offices. 

4. REMINDER 

The applicant must enter the national phase boforo each elected Office by f"*™^ 
translations an d paying national fees) within RO months from the priority data . (ui Mer m some Offices) (Article 
39(1)) (eee alco the reminder sant oy the International Bureau with Form PCTVIB/auij. 

Where a translation of the international application musl be furnished to an elected Office that translation must 
ySSn m ttSSSion of any annexes to the international preliminary examination report. It IS the applicants 
responsibility to prepare and furnish euch translation dlmrtly to eacn elected Office concerned. 

For further details on the applicable timo limits and requirements of the elected Offices, see Volume II of tho 
PCT Applicant's Guide. 
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Applicant's or agent's file reference 
2026-4302PC 


criD ci |n TU r D See Notification of Transmittal of International 

FOR FURTHER ACTION Preliminary Examination Report (Form PCT/IPEA/416) 


International application No. 
PCT/US00/15446 


International filing date (day/month/year) 
02/06/2000 


Priority date (day/month/year) 
04/06/1 999 


International Patent Classification (IPC) or national classification and IPC 
C12N15/51 


Applicant 

THE GOVERNMENT OF THE UNITED STATES OF AMERICA... 



1. This international preliminary examination report has been prepared by this International Preliminary Examining Authority 
and is transmitted to the applicant according to Article 36. 

2. This REPORT consists of a total of 8 sheets, including this cover sheet. 

□ This report is also accompanied by ANNEXES, i.e. sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of sheets. 



3. This report contains indications relating to the following items: 



I 




II 


□ 


III 


n 


IV 


s 


V 




VI 


□ 


VII 


H 


VIII 





citations and explanations suporting such statement 



Date of submission of the demand 
02/01/2001 


Date of completion of this report 
19.10.2001 


Name and mailing address of the international 
preliminary examining authority: 

European Patent Office - P.B. 581 8 Patentlaan 2 
flS) NL-2280 HV Rijswijk - Pays Bas 
C^' Tel. +31 70 340 - 2040 Tx: 31 651 epo nl 
Fax: +31 70 340 - 3016 


Authorized officer ^-r^-^ 

Montero Lopez, B (f ^) j) 

Telephone No. +31 70 340 3739 V^^3-^ 
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INTERNATIONAL PRELIMINARY 
EXAMINATION REPORT 



International application No, PCT/US00/1 5446 



I. Basis of the report 

1 . With regard to the elements of the international application (Replacement sheets which have been furnished to 
the receiving Office in response to an invitation under Article 14 are referred to in this report as "originally filed" 
and are not annexed to this report since they do not contain amendments (Rules 70, 16 and 70. 17)): 
Description, pages: 

1 -54 as originally filed 

Claims, No.: 

1-37 as originally filed 

Drawings, sheets: 

1/21-21/21 as originally filed 

Sequence listing part of the description, pages: 

1-84, as originally filed 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

H contained in the international application in written form. 

IS filed together with the international application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

□ furnished subsequently to this Authority in computer readable form. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 
the international application as filed has been furnished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 
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□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 

5. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 

6. Additional observations, if necessary: 



IV. Lack of unity of invention 

1 . In response to the invitation to restrict or pay additional fees the applicant has: 

□ restricted the claims. 

□ paid additional fees. 

□ paid additional fees under protest. 

EI neither restricted nor paid additional fees. 

2. □ This Authority found that the requirement of unity of invention is not complied and chose, according to Rule 

68.1 , not to invite the applicant to restrict or pay additional fees. 

3. This Authority considers that the requirement of unity of invention in accordance with Rules 13.1, 13.2 and 13.3 is 

□ complied with. 

□ not complied with for the following reasons: 

4. Consequently, the following parts of the international application were the subject of international preliminary 
examination in establishing this report: ""wry 

□ all parts. 

IS the parts relating to claims Nos. 1-1 1, 33, 34, 37 completely, 12-20, 23, 24, 29-32, 35, 36 partially. 

V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability- 
citations and explanations supporting such statement 

1. Statement 

Novelty (N) Yes: Claims 4-15,23,24,33-37 
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No: 



Claims 1-3, 16-20, 29-32 



Inventive step (IS) 



Yes: 
No: 



Claims 

Claims 1 -20, 23, 24, 29-37 



Industrial applicability (IA) Yes: Claims 1-20, 23, 24, 29-37 

No: Claims 



2. Citations and explanations 
see separate sheet 



VII. Certain defects in the international application 

The following defects in the form or contents of the international application have been noted: 
see separate sheet 



VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
see separate sheet 
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Re Item V 

Reasoned statement under Article 35(2) with regard to novelty, inventive step or 
industrial applicability; citations and explanations supporting such statement 

Reference is made to the following documents: 

D1 : EP-A-0 532 167 {Immuno Japan Inc.} 

D2 : Journal of General Virology, 1991, vol. 72, pages 2697-2704, 

H. Okamoto et al. 
D3 : Nucleic Acids Research, vol. 20, no. 13, April 1991, page 3520, 
J.H. Han et al. 

D4: Biochemistry, 1998, vol. 37, no. 10, pages 3392-3401, D. L. Sali et al. 
D5: Antiviral Research, 1999, vol. 42, pages 59-70, J. Martin et al. 

1. The underlying application relates to a nucleic acid molecule which encodes human 
hepatitis C virus of genotype 2a, DNA constructs comprising said nucleic acid, RNA 
transcript of said construct, cell transfected with said transcript, hepatitis C virus 
polypeptide produced by said cell and whose genome comprises said nucleic acid, 
method for assaying candidate antiviral agents against for activity against HCV using 
said cell containing HCV, antibody to said polypeptide or to said HCV, method for 
determining the susceptibility of cells in vitro to support HCV infection using the cells 
transfected with the nucleic acid of claim 1 and compositions comprising said 
polypeptide suspended in a pharmaceutically acceptable diluent or excipient. 

1.1. D1 discloses a non-A, non-B hepatitis virus RNA, from a strain of NANB 
called HC-J6 where the genome shares a 96% identity in 9588 base pairs with 
SEQ ID NO: 1 and a 97.8% identity in 3033 amino acids with SEQ ID NO: 2 of the 
hepatitis C virus genotype 2a strain HC-J6 CH of the present application. D1 also 
discloses antibodies to the polypeptides of the NANB hepatitis virus. 

1.2. D2 discloses the cDNA sequence of the hepatitis C virus isolate HC-J6, also 
where the genome shares a 96% identity in 9588 base pairs with SEQ ID NO: 1 of 
the present application. 

1 .3. D3 discloses the 3' end of the HCV genome which shares a 96% identity in 
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9588 base pairs with SEQ ID NO: 1 of the present application. 

1 .4. Consequently, the present application does not satisfy the criterion set forth in 
Article 33(2) PCT because the subject-matter of claims 1 to 3 , 16 to 20 and 29 to 
32 is not new in respect of prior art as defined in the regulations (Rule 64(1 W3) 
PCT). 

2. The subject-matter of claims 4-1 5, 23, 24, and 33-37 has not been disclosed in the 
state of the art and therefore claims 4-15, 23, 24 and 33-37 are novel according to 
Article 33(2) PCT. 

2.1. Documents D1 and D2 are considered to represent the most relevant state of 
the art and disclose the amino and nucleic acid sequences of the HC-J6 strain of 
hepatitis virus. Due to the fact that the genome sequence of the HC-J6 strain 
disclosed in D1 and D2 is so closely homologous to the genome sequence of the 
hepatitis C virus genotype 2a strain HC-J6 CH of present application, the person 
skilled in the art would consider it a matter of routine to produce DNA constructs 
comprising the nucleic acid of the HC-J6 strain, RNA transcripts comprising said 
DNA construct or cells transfected with said DNA construct or RNA transcript The 
subject-matter of claims 4-15 therefore does not appear to involve an inventive 
step according to Article 33{3} PCT. 

2.2. Claims 23 and 24 involve a method for assaying candidate antiviral agents 
for activity against HCV comprising exposing a cell containing the HCV of claims 
16 or 17 to the candidate antiviral agent. D4 describes the expression of HCV full 
length NS3 and NS4A in insect cells. The NS3/4A complex was purified and the 
dependence of the NS3/4A protease activity on buffer conditions, temperature and 
the presence of detergents was examined. The NS3/4A complex was found to be 
an attractive target for antiviral therapy against HCV. D5 discloses the effects of 
amantadine and interferon oc-2a on hepatitis C virus markers in cultured peripheral 
blood mononuclear cells (PBMC). 27% of the patients showed HCV core and NS3 
specific proliferative responses. D4 and D5 illustrate the classic methods of 
assaying candidate antiviral agents for activity against HCV. Consequently the 
person skilled in the art would consider it a matter of routine to use said methods 
adapted to the HCV cell from the HCV genotype 2a in a method to assay for 
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candidate antiviral agents for activity against HCV. The subject-matter of claims 
23 and 24 does therefore not satisfy the criterion set forth in Article 33(3) PCT as 
the subject-matter of said claims does not involve an inventive step (Rule 65(1 )(2) 
PCT). 

2.3. Claims 33 and 34 involve a method for determining the susceptibility of cells 
in vitro to support HCV infection. In a similar manner to the testing of an antiviral 
agent, the person skilled in the art would consider it a matter of routine to 
determine whether cells in vitro are able to support HCV infection. The subject- 
matter of claims 33 and 34 do not appear to add any inventive features to the 
claims on which they depend. Claims 33 and 34 are therefore considered not to 
involve an inventive step according to Article 33(3) PCT. 

2.4. Furthermore the simple composition of a polypeptide originating from an HCV 
genotype 2a and a pharmaceutical^ acceptable carrier, is also considered to be a 
matter of routine procedure, obvious to the person skilled in the art and therefore 
not involving any inventive skill according to Article 33(3) PCT. Claims 35 to 37 
are therefore also not inventive according to Article 33(3) PCT. 



Re Item VII 

Certain defects in the International application 

1 . Contrary to the requirements of Rule 5. 1 (a)(ii) PCT, many of the relevant prior art 
documents are not mentioned in the description. 



Re Item VIII 

Certain observations on the international application 

1. An independent claim must specify clearly all the essential technical features 
necessary to define the invention. In the present case the nucleic acid and amino acid 
sequences of the HCV genotype 2a provided by SEQ. ID. NOs: 1 and 2 respectively 
are considered to be essential technical features which allow the unambiguous 
characterization of the products concerned. Accordingly the feature of the SEQ. ID. 
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NOs: 1 and 2 should be present into the independent Claim 1 in order to satisfy the 
requirements of clarity according to Article 6 and Rule 6(3)(b) PCT. 

2. Claims 19 and 20 involve "a polypeptide encoded by the nucleic acid sequence 
according to...", however claim 21 indicates that only the specific NS3 protease, E1 
protein, E2 protein or NS4 proteins are claimed. Due to the fact that nucleic acid 
sequences of claims 1 and 3 encode different HCV polypeptides, claims 19 and 20 
encompass HCV polypeptides other than those in claim 21 and therefore the subject- 
matter of claims 19 and 20 is vague and unclear and open to interpretation. 

3. A product is not rendered novel merely by the fact that it is produced by means of a 
new process. Furthermore claims for products defined in terms of a process are 
admissible only if the products as such fulfil the requirements of novelty and inventive 
step according to Articles 33(2) and (3) PCT. Thus claims 12-15 defining a product in 
terms of a process are construed as claims to the product per se . 
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THE INTERNATIONAL SEARCH REPORT 
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fPCT Rule 44.1) 
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International application Nc. 
PCT/ US 00/ 15446 



Date of mailing 
( day/mon thy ear) 



23/02/2001 



FOR FURTHER ACTION See paragraphs 1 and 4 below 



Internationa! filing date 
(day'mcntVyear) 02/0 6/2000 



Applicant 

THE GOVERNMENT OF THE UNITED STATES OF AMERICA. . 



l . [X\ The apphcant is hereby notified that the International Search Report has been established and is transmitted herewith. 
Filing of amendments and statement under Article 19: 

The applicant is entitled, if he so wishes, to amend the claims of the International Application (see Rule 46): 

When? The time limit for filing such amendments is normally 2 months from the date of transmittal of the 
Internationa Search Report: however, lor more details, see the notes on the accompanying sheet. 

Where? Directly to the International Bureau of WIPO CASE, 

34, chemin des Coiombettes . O n ' y ( /\ t f Z 

1211 Geneva 20. Switzerland DUE ClMlA <7>*,'t60l \f tf> H 

Fascimiie No.: (41-22) 740.14.35 " j ^ - f 

1 mo, cafl-im /fe^7 



For more detailed instructions, see the notes on the accompanying sheet. 



The applicant is hereby notified that no International Search Report will be established and that the JeXa- 
Article 1 7(2)(a) to that effect is transmitted herewith. 



3. Q With regard to the protest against payment of (an) additional fee{s) under Rule 40.2. the applicant is notified that: 

r~j the protest together with the decision thereon has been transmitted to the International Bureau toqether with the 
aophcant s request to forward the texts of both the protest and the decision thereon to the designated Offices. 

I I no deci sion has been made yet on the protest; the applicant will be notified as soon as a decision is made. 

4. Further action(s): The applicant is reminded of the following: 

Shortly after 18 months from the priority date, the international application will be published by the International Bureau 
If the applicant wishes to avoid or postpone publication, a notice of withdrawal of the international application or of the 
priority claim must reach the International Bureau as provided in Rules 90o/s.1 and 90o/s.3, respectively before the 
completion of the technical preparations for international publication. 

Within 19 months from the priority date, a demand for international preliminary examination must be filed if the applicant 
wishes to postpone the entry into the national phase until 30 months from the priority date (in some Offices even later). 

Within 20 months from the priority date, the applicant must perform the prescribed acts for entry into the national phase 
before all designated Offices which have not been elected in the demand or in a later election within 1 9 months from the 
priority date or could not be elected because they are not bound by Chapter II. 




Name and mailing address of the International Searching Authority 
~~ European Patent Office, P.B. 5818 Patentlaan 2 
NL-2280 HV Rtjswijk 

Tel. (+31-70) 340-2040, Tx. 31 651 epo nl 
Fax; (+31-70) 340-3016 



Authorized officer 

Catherine Humbert 
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INSTRUCTIONS CONCERNING AMENDMENTS UNDER ARTICLE 19 

in^tff.^ .0 amend the c, a , ms o, the 

description and drawings) may be amended durmn thl in;Vn^L^ P international application (claims, 

no need to file amendments of the claTm und Art cle 9 S Pr0Ce , dure - there l£ ^ 

for the purposes of provisional protection or ha<^ annthP ,^crf« !' 9 ' u a PP |lCant wants the latter to be published 

Furthermore, i, should be e^^^^o^Z ^^^^e^ 0 ™ 1 ""'^ 

What parts of the international application may be amended? 

Under Article 19. only the claims may be amenaec 

Article 34 before the International Examin.np Authority. drawings may only be amended under 

oXlr^Xbl' Ahide ^ aS6 ' a " Pan? °* ' he ,n ' e ™"<™' -PP'-ation may be amended under Article 26 



When? 



How? 



X whi^ S6arCh ' ep ° n ° r 15 ™ nths *™ «he Priority 

as having been received on i™ i Mhey are Reeled Zit^r ^Vf* amendm ents will be considered 
applicable,^ l,m„ but betore the cc^on^hl .ffiSttK"" 

Where not to file the amendments? 

i^,^ B — -d no, w„h the receding Oftice or the 

Where a demand for international preliminary examination has been/is Wed. see below. 

1^^™^^^°'*'' da ' mS - * ^ «»» or more new Cairns or by amending the text of 
tZ^ZT^o^l ^^^^ °' ^ ^ M °" °< or 

be renumbered consecutively (Adm.n*,^ ClaimS are -numbered, they must 

The amendments must be made in the language in which the international application is to be published. 

What documents must/may accompany the amendments? 
Letter (Section 205(b)): 

The amendments must be submitted with a letter. 
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Applicant's or agent's file reference 

2026-4302PC 


FOR FURTHER see Notification of Transmittal of International Search Report 
ACTION CF ° rm PCT/ISA/22 °) as wel1 as - where applicable, item 5 below. 


International application No. 

PCT/ US 00/ 15446 


International filing date (day/month/year) 

02/06/2000 


(Earliest) Priority Date (day/month/year) 

04/06/1999 


Applicant — ■ 

THE GOVERNMENT OF THE UNITED STATES OF AMERICA... 



This International Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 18. A copy is being transmitted to the International Bureau. dppncam 



This International Search Report consists of a total of _ 



11 



sheets. 



[X] It is also accompanied by a copy of each prior art document cited in this report. 



1 . Basis of the report 

a. With regard to the language the international search was carried out on the basis of the international application in the 
language in which it was filed, unless otherwise indicated under this item. ^ 

^ Authority Ste^b)) ^ ^ ^ ^ ° f * translation of the international application furnished to this 

b. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the international search 
was carried out on the basis of the sequence listing : HK l u dl bearcn 

[X] contained in the international application in written form. 

[X| filed together with the international application in computer readable form. 

I | furnished subsequently to this Authority in written form. 

I | furnished subsequently to this Authority in computer readble form. 

□ the statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international application as filed has been furnished. 

^ furnfshed ment ^ ^ information recorded in computer readable form is identical to the written sequence listing has been 

2 Q Certain claims were found unsearchable (See Box I). 
3- [X] Unity of invention is lacking (see Box II). 

4. With regard to the title, 

[X] the text is approved as submitted by the applicant. 

[~l tne text has been established by this Authority to read as follows: 



5. With regard to the abstract, 

[X] the text is approved as submitted by the applicant. 

□ th l text has been established, according to Rule 38.2(b), by this Authority as it appears in Box III. The applicant may 
within one month from the date of mailing of this international search report, submit comments to this Authority. 

6. The figure of the drawings to be published with the abstract is Figure No. -| 

ffl as suggested by the applicant. Q None 0 f the figures. 

Q because the applicant failed to suggest a figure. 
Q because this figure better characterizes the invention. 
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A. CLASSIFICATION OF SUBJECT MATTER 

IPC 7 C12N15/51 C07K14/18 C07K16/18 A61K38/00 A61K39/00 



According to International Patent Classification (IPO) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C07K C12N A61K 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name ot data base and, where practical, search terms used) 

EPO-Internal , EMBASE, MEDLINE , STRAND, CAB Data, WPI Data, BIOSIS 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



EP 0 532 167 A (JAPAN IMMUNO INC) 
17 March 1993 (1993-03-17) 
the whole document 

H. 0KAM0T0 ET AL. : "Nucleotide sequence 

of the genomic RNA of hepatitis C virus 

isolated from a human carrier: comparison 

with reported isolates for conserved and 

divergent regions." 

JOURNAL OF GENERAL VIROLOGY , 

vol. 72, 1991, pages 2697-2704, 

XP000911895 

the whole document 

-/-- 



1-37 



1-37 



rther documents are listed in the continuation of box C. 



El 



Patent family members are listed in annex. 



° Special categories of cited documents : 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-11, 33, 34, 

37 completely and partially claims 12-20, 23, 24, 
29-32, 35, 36 and 37 



A purified and isolated nucleic acid molecule which encodes 
human hepatitis C virus of genotype 2a, DNA constructs 
comprising said nucleic acid, RNA transcript of said 
construct, cell transfected with said transcript, hepatitis 
C virus polypeptide produced by said cell and whose genome 
comprises said nucleic acid, method for assaying candidate 
antiviral agents against for activity against HCV using said 
cell containing HCV, antibody to said polypetide or to said 
HCV, method for determining the susceptibility of cells in 
vitro to support HCV infection using the cells transfected 
with the nucleic acid of claim 1 and compositions comprising 
said polypeptide suspended in a pharmaceuti cal ly acceptable 
diluent or excipient. 



2. Claims: 25 and 26 completely and 12-23, 24, 29-32, 35, 
36 and 37 parti al ly 



A hepatitis C virus polypeptide produced by a cell 
transfected with a DNA construct comprising a nucleic acid 
molecule which encodes human hepatitis C virus of genotype 
2a which is an NS3 protease and method for assaying 
candidate antiviral agents against for activity against HCV 
comprising exposing said HCV protease to candidate antiviral 
agents, antibody to said polypeptide or HCV and compositions 
comprising said polypeptide suspended in a pharmaceutically 
acceptable diluent or excipient. 



3. Claims: 12-23, 24, 29-32, 35, 36 and 37 partially 

A hepatitis C virus polypeptide produced by a cell 
transfected with a DNA construct comprising a nucleic acid 
molecule which encodes human hepatitis C virus of genotype 
2a which is an El protein, antibody to said polypeptide or 
HCV and compositions comprising said polypeptide suspended 
in a pharmaceutically acceptable diluent or excipient.. 



4. Claims: 12-23, 24, 29-32, 35, 36 and 37 partially 

A hepatitis C virus polypeptide produced by a cell 
transfected with a DNA construct comprising a nucleic acid 
molecule which encodes human hepatitis C virus of genotype 
2a which is an E2 protein, antibody to said polypeptide or 
HCV and compositions comprising said polypeptide suspended 
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in a pharmaceutical^ acceptable diluent or excipient. 



5. Claims: 12-23, 24, 29-32, 35, 36 and 37 partially 

A hepatitis C virus polypeptide produced by a cell 
transfected with a DNA construct comprising a nucleic acid 
molecule which encodes human hepatitis C virus of genotype 
2a which is an NS4 protein, antibody to said polypeptide or 
HCV and compositions comprising said polypeptide suspended 
in a pharmaceuti cal ly acceptable diluent or excipient.. 



6. Claim : 27 and 28 completely 

Antiviral agent identified as having antiviral activity for 
HCV by the method of claims 23 and/or 25. 
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WRITTEN OPINION International application No. PCT/US00/1 5446 

I. Basis of the opinion 

1. With regard to the elements of the international application (Replacement sheets which have been furnished to 
the receiving Office in response to an invitation under Article 14 are referred to in this opinion as "originally filed"): 

Description, pages: 

1 -54 as originally filed 

Claims, No.: 

1-37 as originally filed 

Drawings, sheets: 

1/21-21/21 as originally filed 



2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1(b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

□ contained in the international application in written form. 

□ filed together with the international application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

□ furnished subsequently to this Authority in computer readable form. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 
the international application as filed has been furnished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 
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□ 



the drawings, 



sheets: 



5. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 

6. Additional observations, if necessary: 

IV. Lack of unity of invention 

1. In response to the invitation (Form PCT/IPEA/405) to restrict or pay additional fees, the applicant has: 

□ restricted the claims. 

□ paid additional fees. 

□ paid additional fees under protest, 
neither restricted nor paid additional fees. 

2. □ This Authority found that the requirement of unity of invention is not complied with for the following reasons 

and chose, according to Rule 68.1 , not to invite the applicant to restrict or pay additional fees: 

3. Consequently, the following parts of the international application were the subject of international preliminary 
examination in establishing this opinion: 

□ all parts. 

the parts relating to claims Nos. 1-11, 33, 34, 37 completely, 12-20, 23, 24, 29-32, 35, 36 partially. 

V. Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

1. Statement ^ 
Novelty (N) Claims 1 -3, 1 6-20, 29-32 

Inventive step (IS) Claims 4-15, 23, 24, 33-37 

Industrial applicability (IA) Claims 

2. Citations and explanations 
see separate sheet 

VII. Certain defects in the international application 

The following defects in the form or contents of the international application have been noted: 
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see separate sheet 



VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
see separate sheet 
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IV. Lack of unity (Continuation) 

The IPEA agrees with the ISA that the different subject-matters of the present 
application are not so linked as to form a single general inventive concept (Rule 
13.1 PCT) for the following reasons; 

The prior art Dl: Journal of General Virology, 1991 , vol. 72, pages 2697-2704, 

H. Okamoto et al. discloses the cDNA sequence of the hepatitis C virus isolate 
HC-J6. 

Dll: Journal of Clinical Microbiology, 1997, vol. 35, no. 1, pages 201-207, T. 
Ohno et al. describes the genotyping of HCV based on PCR of the core region 
with genotype specific primers for the determination of HCV genotypes 1a, 1b, 
2a, 2b, 3a, 3b, 4, 5a and 6a. 

DIM: International Heptatolgy Communications, 1996, vol. 4, pages 263-267, M. 
Hashimoto et al. discloses a genotyping system using primers from the HS5 
region allows determination of the HCV genotypes 1a, 1b, 2a f 2b and 3b 

In view of the state of the art the problem may therefore be defined as the 
provision of the complete nucleic acid sequence which comprises the genome 
of infectious HCV genotype 2a. 

The present application provides the solutions of ; 

I. }the complete nucleic acid sequence which comprises the genome of 
infectious HCV genotype 2a. 

2. } a hepatitis C virus polypeptide produced by a cell transfected with a DNA 
construct comprising a nucleic acidjnolecule which encodes human hepatitis C 
virus of genotype 2a which is; 

2.1} an NS3 protease, 
2.2} an E1 protein, 
2.3} an E2 protein, 
2.4} an NS4 protein, and 

3. } Antiviral agent identified as having antiviral activity for HCV by the method of 
claims 23 and/or 25. 
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Consequently due to the fact that the isolate comprising the HCV 2a genotype, 
Dl is known in the state of the art, and that genotyping methods to different 
coding regions of the HCV genome have been used in the state of the art to 
identify HCV 2a gentypes due to the fact that the different solutions essentially 
different in terms of technical structure and function and due to the absence of 
further technical features which could provide a common novel and inventive 
linking concept, the IPEA is of the opinion that there is no single inventive 
concept underlying the set of claimed inventions of the present application 
according to Rule 13.1 PCT. 

There is therefore lack of unity and the different inventions, not belonging to a 
common inventive concept are formulated as the following different subjects 
according to Article 17{3}{a} PCT; 

Invention 1 : Claims 1 -1 1 , 33, 34, 37 completely and partially claims 1 2-20, 23, 
24, 29-32, 35, 36 and 37: A purified and isolated nucleic acid molecule which 
encodes human hepatitis C virus of genotype 2a, DNA constructs comprising 
said nucleic acid, RNA transcript of said construct, cell transfected with said 
transcript, hepatitis C virus polypeptide produced by said cell and whose 
genome comprises said nucleic acid, method for assaying candidate antiviral 
agents against for activity against HCV using said cell containing HCV, antibody 
to said polypetide or to said HCV, method for determining the susceptibility of 
cells in vitro to support HCV infection using the cells transfected with the nucleic 
acid of claim 1 and compositions comprising said polypeptide suspended in a 
pharmaceutical^ acceptable diluent or excipient. 

Invention 2 : Claims 25 and 26 completely and 12-23, 24, 29-32, 35, 36 and 37 
partially: A hepatitis C virus polypeptide produced by a cell transfected with a 
DNA construct comprising a nucleic acid molecule which encodes human 
hepatitis C virus of genotype 2a which is an NS3 protease and method for 
assaying candidate antiviral agents against for activity against HCV comprising 
exposing said HCV protease to candidate antiviral agents, antibody to said 
polypeptide or HCV and compositions comprising said polypeptide suspended 
in a pharmaceutical^ acceptable diluent or excipient. 
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Invention 3 : Claims 12-23, 24, 29-32, 35, 36 and 37 partially: A hepatitis C virus 
polypeptide produced by a cell transfected with a DNA construct comprising a 
nucleic acid molecule which encodes human hepatitis C virus of genotype 2a 
which is an 

E1 protein, antibody to said polypeptide or HCV and compositions comprising 
said polypeptide suspended in a pharmaceutical^ acceptable diluent or 
excipient. 

Invention 4 : Claims 12-23, 24, 29-32, 35, 36 and 37 partially: A hepatitis C virus 
polypeptide produced by a cell transfected with a DNA construct comprising a 
nucleic acid molecule which encodes human hepatitis C virus of genotype 2a 
which is an 

E2 protein, antibody to said polypeptide or HCV and compositions comprising 
said polypeptide suspended in a pharmaceutical^ acceptable diluent or 
excipient. 

Invention 5 : Claims 12-23, 24, 29-32, 35, 36 and 37 partially: A hepatitis C virus 
polypeptide produced by a cell transfected with a DNA construct comprising a 
nucleic acid molecule which encodes human hepatitis C virus of genotype 2a 
which is an 

NS4 protein, antibody to said polypeptide or HCV and compositions comprising 
said polypeptide suspended in a pharmaceutical^ acceptable diluent or 
excipient. 

Invention 6 : Claims 27 and 28 completely: Antiviral agent identified as having 
antiviral activity for HCV by the method of claims 23 and/or 25. 



Form PCT/Separate Sheet/408 (Sheet 3) (EPO-April 1997) 



WRITTEN OPINION International application No. PCT/US00/1 5446 

SEPARATE SHEET 



V. Reasoned statement (Continuation) 

Invention 1: Claims 1-11, 33, 34, 37 completely and partially claims 12-20, 
23, 24, 29-32, 35, 36 and 37: A purified and isolated nucleic acid molecule 
which encodes human hepatitis C virus of genotype 2a, DNA constructs 
comprising said nucleic acid, RNA transcript of said construct, cell 
transfected with said transcript, hepatitis C virus polypeptide produced by 
said cell and whose genome comprises said nucleic acid, method for 
assaying candidate antiviral agents against for activity against HCV using 
said cell containing HCV, antibody to said polypeptide or to said HCV, 
method for determining the susceptibility of cells in vitro to support HCV 
infection using the cells transfected with the nucleic acid of claim 1 and 
compositions comprising said polypeptide suspended in a 
pharmaceutical^ acceptable diluent or excipient. 

a. CITATIONS 

Reference is made to the following documents: 
D1 : EP-A-0 532 167 {Immuno Japan Inc.} 

D2 : Journal of General Virology, 1991, vol. 72, pages 2697-2704, 

H. Okamoto et al. 
D3 : Nucleic Acids Research, vol. 20, no. 13, April 1991, page 3520, 
J.H. Han et al. 

D4: Biochemistry, 1998, vol. 37, no. 10, pages 3392-3401, D. L. Sali et al. 
D5: Antiviral Research, 1999, vol. 42, pages 59-70, J. Martin et al. 

b. NOVELTY (Art. 33(2) PCT) 

i. D1 discloses a non-A, non-B hepatitis virus RNA, from a strain 
of NANB called HC-J6 where the genome shares a 96% identity 
in 9588 base pairs with SEQ ID NO: 1 and a 97.8% identity in 
3033 amino acids with SEQ ID NO: 2 of the hepatitis C virus 
genotype 2a strain HC-J6 CH present application. D1 also 
discloses antibodies to the polypeptides of the NANB hepatitis 
virus. 
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ii. D2 discloses the cDNA sequence of the hepatitis C virus isolate 
HC-J6, also where the genome shares a 96% identity in 9588 
base pairs with SEQ ID NO: 1 of the present application. 

iii. D3 discloses the 3' end of the HCV genome which shares a 
96% identity in 9588 base pairs with SEQ ID NO: 1 of the 
present application. 

iv. The present application does not satisfy the criterion set forth in 
Article 33(2) PCT because the subject-matter of claims 1 to 3 , 
1 6 to 20 and 29 to 32 is not new in respect of prior art as 
defined in the regulations (Rule 64(1 )-(3) PCT). 

c. INVENTIVE STEP (Art. 33(3) PCT) 

i. Documents D1 and D2 are considered to represent the most 
relevant state of the art and discloses the amino and nucleic 
acid sequences of the HC-J6 strain of hepatitis virus. 

ii. Due to the fact that the genome sequence of the HC-J6 strain 
disclosed in D1 and D2 is so closely homologous to the genome 
sequence of the hepatitis C virus genotype 2a strain HC-J6 CH 
present application, the person skilled in the art would consider 
it a matter of routine to produce DNA constructs comprising the 
nucleic acid of the HC-J6 strain, RNA transcripts comprising 
said DNA construct or cells transfected with said DNA construct 
or RNA transcript. 

iii. The subject-matter of claims 4 to 15 therefore do not appear 
to involve an inventive step according to Article 33{3} PCT. 

iv. Claims 23 and 24 involve a method for assaying candidate 
antiviral agents for activity against HCV comprising exposing a 
cell containing the HCV of claims 16 or 17 to the candidate 
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antiviral agent. 

v. D4 describes the expression of HCV full length NS3 and NS4A 
in insect cells. The NS3/4A complex was purified and the 
dependence of the NS3/4A protease activity on buffer 
conditions, temperature and the presence of detergents was 
examined. The NS3/4A complex was found to be an attractive 
target for antiviral therapy against HCV. 

vi. D5 discloses the effects of amantadine and interferon oc-2a on 
hepatitis C virus markers in cultured peripheral blood 
mononuclear cells (PBMC). 27% of the patients showed HCV 
core and NS3 specific proliferative responses. 

vii. D4 and D5 illustrate the classic methods of assaying candidate 
antiviral agents for activity against HCV. Consequently the 
person skilled in the art would consider it a matter of routine to 
use said methods adapted to the HCV cell from the HCV 
genotype 2a in a method to assay for candidate antiviral agents 
for activity against HCV. 

viii. The subject-matter of claims 23 and 24 does therefore not 
satisfy the criterion set forth in Article 33(3) PCT as the 
subject-matter of said claims does not involve an inventive step 
(Rule 65(1 )(2) PCT). 

ix. Claims 33 and 34 involve a method for determining the 
susceptibility of cells in vitro to support HCV infection. In a 
similar manner to the testing of an antiviral agent, the person 
skilled in the art would consider it a matter of routine to 
determine whether cells in vitro are able to support HCV 
infection. The subject-matter of claims 33 and 34 do not appear 
to add any inventive features to the claims on which they 
depend. 
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x. Claims 33 and 34 are therefore considered not to involve an 
inventive step according to Article 33{3} PCT. 

xi. Furthermore the simple composition of a polypeptide originating 
from an HCV genotype 2a and a pharmaceutical^ acceptable 
carrier, is also considered to be a matter of routine procedure, 
obvious to the person skilled in the art and therefore not 
involving any inventive skill according to Article 33{3} PCT. 
Claims 35 to 37 are therefore also not inventive according to 
Article 33{3} PCT. 

VII. Certain defects (Continuation) 

1 . Contrary to the requirements of Rule 5.1 (a)(ii) PCT, many of the relevant 
prior art documents are not mentioned in the description. 

2. If amendments are filed, it should be by way of replacement pages in the 
manner stipulated by Rule 66.8(a) PCT. In particular, fair copies of the 
amendments should be filed preferably in triplicate. Moreover, the 
applicant's attention is drawn to the fact that, as a consequence of Rule 
66.8(a) PCT the examiner is not permitted to carry out any amendments 
under the PCT procedure, however minor these may be. 

3. In order to facilitate the examination of the conformity of the amended 
application with the requirements of Article 34(2)(b) PCT, the applicant 
is requested to clearly identify the amendments carried out, no matter 
whether they concern amendments by addition, replacement or deletion, 
and to indicate the passages of the application as filed on which these 
amendments are based (see also Rule 66.8(a) PCT). If the applicant 
regards it as appropriate these indications could be submitted in 
handwritten form on a copy of the relevant parts of the application as 
filed. 
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4. The applicant is requested to note that in accordance with Rule 66.4 (a) 
PCT the issuance of an additional Written Opinion (WO) is facultative. 
Moreover, as the final action in the PCT procedure is an International 
Preliminary Examination Report (IPER) and not a decision, a violation 
of the right to be heard cannot exist. The applicant can not therefore rely 
on obtaining a second WO before the IPER is issued and is requested to 
answer this first WO in a complete manner. 

VIII. Certain Observations (Continuation) 

1 The application does not meet the requirements of Article 6 PCT 
because claims are not clear for the following reasons: 

1.1 An independent claim must specify clearly all the essential technical 
features necessary to define the invention. In the present case the 
nucleic acid and amino acid sequences of the HCV genotype 2a provided 
by SEQ. ID. NOs: 1 and 2 respectively are considered to be essential 
technical features which allow the unambiguous characterization of the 
products concerned. Accordingly the feature of the SEQ. ID. NOs: 1 and 
2 must be incorporated into the independent Claim 1 in order to satisfy 
the requirements of clarity according to Article 6 and Rule 6{3}{b} PCT. 

1.2 Claims 19 and 20 involve "a polypeptide encoded by the nucleic acid 
sequence according to...", however claim 21 indicates that only the 
specific NS3 protease, E1 protein, E2 protein or NS4 proteins are 
claimed. Due to the fact that nucleic acid sequences of claims 1 and 3 
encode different HCV polypeptides, claims 19 and 20 encompass HCV 
polypeptides other than those in claim 21 and therefore the subject- 
matter of claims 19 and 20 is vague and unclear and open to 
interpretation. 
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A product is not rendered novel merely by the fact that it is produced by 
means of a new process. Furthermore claims for products defined in 
terms of a process are admissible only if the products as such fulfil the 
requirements of novelty and inventive step according to Articles 33{2} 
and {3} PCT. Thus claims 12 to 15 defining a product in terms of a 
process are construed as claims to the product per se. 
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ATTENTION 

The applicant should carefully check the data appearing in this Notification. In case of any discrepancy between these data 
and the indications in the international application, the applicant should immediately inform the International Bureau. 
In addition, the applicant's attention is drawn to the information contained in the Annex, relating to: 

| X | time limits for entry into the national phase 

| | confirmation of precautionary designations 

| X | requirements regarding priority documents 

A copy of this Notification is being sent to the receiving Office and to the International Searching Authority. 
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INFORMATION ON TIME LIMITS FOR ENTERING THE NATIONAL PHASE 

The applicant is reminded that the "national phase" must be entered before each of the designated Offices indicated in the 
Notification of Receipt of Record Copy (Form PCT/IB/301 ) by paying national fees and furnishing translations, as prescribed by 
the applicable national laws. 

The time limit for performing these procedural acts is 20 MONTHS from the priority date or, for those designated States 
which the applicant elects in a demand for international preliminary examination or in a later election, 30 MONTHS from the 
priority date, provided that the election is made before the expiration of 19 months from the priority date. Some designated (or 
elected) Offices have fixed time limits which expire even later than 20 or 30 months from the priority date. In other Offices an 
extension of time or grace period, in some cases upon payment of an additional fee, is available. 

In addition to these procedural acts, the applicant may also have to comply with other special requirements applicable in 
certain Offices, rt is the applicant's responsibility to ensure that the necessary steps to enter the national phase are taken in a 
timely fashion. Most designated Offices do not issue reminders to applicants in connection with the entry into the national 
phase. 

For detailed information about the procedural acts to be performed to enter the national phase before each designated 
Office, the applicable time limits and possible extensions of time or grace periods, and any other requirements, see the relevant 
Chapters of Volume II of the PCT Applicant's Guide. Information about the requirements for filing a demand for international 
preliminary examination is set out in Chapter IX of Volume I of the PCT Applicant's Guide. 

GR and ES became bound by PCT Chapter II on 7 September 1996 and 6 September 1997, respectively, and may, therefore, 
be elected in a demand or a later election filed on or after 7 September 1 996 and 6 September 1 997, respectively, regardless of 
the filing date of the international application. (See second paragraph above.) 

Note that only an applicant who is a national or resident of a PCT Contracting State which is bound by Chapter II has 
the right to file a demand for international preliminary examination. 

CONFIRMATION OF PRECAUTIONARY DESIGNATIONS 

This notification lists only specific designations made under Rule 4.9(a) in the request. It is important to check that these 
designations are correct. Errors in designations.can be corrected where precautionary designations have been made under 
Rule 4.9(b). The applicant is hereby reminded that any precautionary designations may be confirmed according to Rule 4.9(c) 
before the expiration of 1 5 months from the priority date. If it is not confirmed, it will automatically be regarded as withdrawn 
by the applicant There will be no reminder and no invitation. Confirmation of a designation consists of the filing of a notice 
specifying the designated State concerned (with an indication of the kind of protection or treatment desired) and the payment 
of the designation and confirmation fees. Confirmation must reach the receiving Office within the 15-month time limit. 



REQUIREMENTS REGARDING PRIORITY DOCUMENTS 

For applicants who have not yet complied with the requirements regarding priority documents, the following is recalled. 

Where the priority of an earlier national, regional or international application is claimed, the applicant must submit a copy 
of the said earlier application, certified by the authority with which it was filed ("the priority document") to the receiving Office 
(which will transmit it to the International Bureau) or directly to the International Bureau, before the expiration of 16 months frorr 
the priority date, provided that any such priority document may still be submitted to the International Bureau before that date of 
international publication of the international application, in which case that document will be considered to have been received 
by the International Bureau on the last day of the 16-month time limit (Rule 17.1(a)). 

Where the priority document is issued by the receiving Office, the applicant may, instead of submitting the priority 
document, request the receiving Office to prepare and transmit the priority document to the International Bureau. Such request 
must be made before the expiration of the 16-month time limit and may be subjected by the receiving Office to the payment 
of a fee (Rule 1 7.1 (b)). 

If the priority document concerned is not submitted to the International Bureau or if the request to the receiving Office 
to prepare and transmit the priority document has not been made (and the corresponding fee, if any, paid) within theUpplicable 
time limit indicated under the preceding paragraphs, any designated State may disregard the priority claim, provided that no 
designated Office may disregard the priority claim concerned before giving the applicant an opportunity to furnish the priority 
document within a time limit which is reasonable under the circumstances. 

Where several priorities are claimed, the priority date to be considered for the purposes of computing the 16-month time 
limit is the filing date of the earliest application whose priority is claimed. 
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to the following designated Offices on the date indicated above as the date of mailing of this Notice: 

AG,AU,DZ,KP,KR,MZ,US 

In accordance with Rule 47.1 (c), third sentence, those Offices will accept the present Notice as conclusive evidence that 
the communication of the international application has duly taken place on the date of mailing indicated above and no copy 
of the international application is required to be furnished by the applicant to the designated Office(s). 

2. The following designated Offices have waived the requirement for such a communication at this time: 

AE^AM^AT^BA^B^CBR^Y^A^H^N^R^U^DE^K^M^A^E^P^S^KGB^D, 
GE^H^M^R^UJDJUINJSJ^KE^CKZ^CLK^R^LT^U^^M^MaMCMK^MN^W^X, 

NO,NZ,OA,PL,PT,RO,RU,SD,SE,SG,SI,SK,SL,TJ,TM,TR,^^ . . 

The corfimunication will be made to those Offices only upon their request. Furthermore, those Offices do not require the 
applicant to furnish a copy of the international application (Rule 49.1 (a-bis)). 

3. Enclosed with this Notice is a copy of the international application as published by the International Bureau on 
14 December 2000 (14.12.00) under No. WO 00/75338 

REMINDER REGARDING CHAPTER II (Article 31(2)(a) and Rule 54.2) 

If the applicant wishes to postpone entry into the national phase until 30 months (or later in some Offices) from the priority 
date, a demand for international preliminary examination must be filed with the competent International Preliminary 
Examining Authority before the expiration of 1 9 months from the priority date. 

It is the applicant's sole responsibility to monitor the 1 9-month time limit. 

Note that only an applicant who is a national or resident of a PCT Contracting State which is bound by Chapter II has the 
right to file a demand for international preliminary examination. 

REMINDER REGARDING ENTRY INTO THE NATIONAL PHASE (Article 22 or 39(1)) 

If the applicant wishes to proceed with the international application in the national phase, he must within 20 months 
or 30 months, or later in some Offices, perform the acts referred to therein before each designated or elected Office. 

For further important information on the time limits and acts to be performed for entering the national phase, see the 
Annex to Form PCT/IB/301 (Notification of Receipt of Record Copy) and Volume II of the PCT Applicant's Guide. 
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The International Bureau of WIPO 






34, chemin des Colombettes 


J. Zahra 




1211 Geneva 20, Switzerland 






Facsimile No. (41-22) 740.14.35 


Telephone No. (41-22) 338.83.38 




Form PCT/IB/308 (July 1996) 




3709975 





PCT 


International Application No. 




REQUEST 


International riling uate 




The undersigned requests that the present 

international application be processed 
according to the Patent Cooperation Treaty. 


Name of receiving Office and "PCT International Application" 




Applicant's or agent's file reference 

(if desired) (12 characters maximum) 2026-4302 PC 



Box No. I TITLE OF INVENTION 

CLONED GENOME OF INFECTIOUS HEPATITIS C VIRUS OF GENOTYPE 2a AND USES THEREOF , 



State (thai is, country) of nationality: 

U 0 


State (that is, country) of 


residence: 


US 


This person is applicant 1 1 all designated fyl all designated States except | | the United States 

for the purposes of: 1 1 States 1 A 1 the United States of America | | of America only 


1 1 the States indicated in 
| J the Supplemental Box 


Box No. ni FURTHER APPLICANT(S) AND/OR (FURTHER) INVENTOR(S) 





Box No. II APPLICANT 



Name and address: (Family name followed by given name; for a legal entity, full official 
designation. The address must include postal code and name of country. The country of the 
address indicated in this Box is the applicant s State (that is, country) of residence if no State 



of residence is indicated below.) 

The Government of the United States of America 

as represented by the Secretary, Department of 

Health and Human Services 

Office of Technology Transfer 

National Institutes of Health 

6011 Executive Boulevard, Suite 325 

Rockville, Maryland 20852 

US 



J I This person is also inventor. 



Telephone No. 




(301) 496- 


7056 


Facsimile No. 




(301) 402- 


0220 



Teleprinter No. 



Name and address: (Family name followed by given name; for a legal entity, full official 
designation. The address must include postal code and name of country. The country of the 
address indicated m this Box is die applicant s State (that is. country) of residence if no State 
of residence is indicated below.) 

Y AN AG I , Masayuki 

257 Congressional Lane, #402 

Rockville, Maryland 20852 

US 



This person is: 

I I applicant only 

I x I applicant and inventor 

I I inventor only (If this check-box 
1 1 jj marked, do not fill in below.) 



State (that is, country) of nationality: 



JP 



State (that is, country) of residence: 



US 



This person is applicant 
for the purposes of: 



|""""^| mil designated 



Sutes 



□ all designated States except 
the United Slates of America 



|""^"| the United States 



of America only 



□ the States indicated in 
the Supplemental Box 



[ X 1 Further applicants and/or (further) inventors are indicated on a continuation sheet. 



Box No. IV AGENT OR COMMON REPRESENTATIVE; OR ADDRESS FOR CORRESPONDENCE 



The person identified below is hereby/has been appointed to act on behalf 
of the applicant(s) before the competent International Authorities as: 



|X j agent | | common representative 



Name and address: (Family name followed by given name; for a legal entity, full official 
designation. The address must include postal code and name of country.) 

FEILER, William S.; BORK, Richard W. and CHEN, Haiyan 

Morgan & Finnegan, L.L.P. - 

345 Park Avenue 

New York, New York 10154 

US 



Telephone No. 

(212) 758-4800 



Facsimile No. 

(212) 751-6849 



Teleprinter No. 



□ 



Address for correspondence: Mark this check-box where no agent or con»r ; io*!. representative is/has been appointed and the 
space above is used instead to indicate a special address to which correspondence should be sent. 



Form PCT/RO/101 (first sheet) (July 1998; reprint January 2000) 



See Notes to the request form 



Sheet No. 



Doc, 



No. 2026-4302PC 



Continuation of Box No. Ill FURTHER AFPUCANT(S) AND/OR (FURTHER) INVENTOR(S) 



If none of the following sub-boxes is used, this sheet should not be included in the request 



Name and address: (family name followed by given name; for a legal entity, full official 
designation. The address must include postal code and name of country. The country of the 
address indicated in this Box is the applicant's State (that is, country) of residence if no State 
of residence is indicated below.) 



BUKH, Jens 

2018 Baltimore Road #J42 
Rockville, Maryland 20851 
US 



This person is: 

| | applicant only 

| X | applicant and inventor 

| | inventor only (If this check-box 
*—* is marked, do not fill in below.) 



State (that is. country) of nationality: 

DK 


State (that is, country) of 


residence: 

US 


This person is applicant 1 1 ill designated f— I all designated States except f— i the 

for the purposes of: 1 1 States 1 1 the United States of America |X | 0 f 


United States |— I the States indicated in 
America only 1 1 the Supplemental Box 



Name and address: (Family name followed by given name; for a legal entity, full official 
designation. The address must include postal code and name of country. The country of the 
address indicated in this Box is the applicant s State (that is. country) of residence if no State 
of residence is indicated below.) 



EMERSON, Suzanne U. 
4517 Everett Street 
Kensington, Maryland 20895 
US 



This person is: 

| | applicant only 

| X | applicant and inventor 

I 1 inventor only (If this check-box 
is marked, do not fill in below.) 



State (that is, country) of nationality: 

US 


State (that is, country) of residence: 


US 


This person is applicant | 1 a II designated 

for the purposes of: | | States 


| 1 all designated Slates except rrrn the United States 
1 1 the United States of America I A I of America only 


i 1 the States indicated in 
1 1 the Supplemental Box 



Name and address: (Family name followed by given name; for a legal entity, full official 
designation. The address must include postal code and name of country. The country of the 
address indicated in this Box is the applicant s State (that is, country) of residence if no State 
of residence is indicated below.) 



PURCELL, Robert H. 
17517 White Ground Road 
Boyds, Maryland 20841 
US 



State (that is, country) of nationality: 



US 



This person is: 

| | applicant only 

| X | applicant and inventor 

I I inventor only (If this check-box 
' is marked, do not fill in below.) 



State (that is, country) of residence: 



US 



This person is applicant 
for the purposes of: 



□ all designated I | all designated States except 
States I I the United Slates of America 



the United States 
of America only 



□ the States indicated in 
the Supplemental Box 



Name and address: (Family name followed by given name; for a legal entity, full official 
designation. The address must include postal code and name of country. The country of the 
address indicated in this Box is the applicant's State (that is. country) of residence if no State 
of residence is indicated below.) 


This person is: 

| | applicant only 

| | applicant and inventor 

| | inventor only (If this check-box 
is marked, do not fill in below.) 


State (that is, country) of nationality: 


State (that is, country) of residence: 



This person is applicant 
for the purposes of; 



□ all designated ( 1 all designated States except 
States | I the United Slates of America 



□ the United States 
of America only 



□ the States indicated in 
the Supplemental Box 



| Further applicants and/or (further) inventors are indicated on another continuation sheet. 



Form PCT/RO/101 (continuation sheet) (July 1998; reprint January 2000) 



See Notes to the request form 



Boi No.V DESIGNATION OF STATES 



Sheet No. 3 Docket 



2026-4302PC 



The following designations are hereby made under Rule 4.9(a) (mark the applicable check-boxes; at least one must be marked): 
Regional Patent 

H AP ARIPO Patent GH Ghana, GM Gambia, KE Kenya, LS Leaotho, MW Malawi, SD Sudan, SL Sierra Leone, SZ Swaziland, 
TZ United Republic of Tanzania, UG Uganda, ZW Zimbabwe, and any other State which is a Contracting State of the Harare 
Protocol and of the PCT 

ED EA Eurasian Patent: AM Armenia, AZ Azerbaijan, BY Belarus, KG Kyrgyzstan, KZ Kazakhstan, MD Republic of Moldova, 
RU Russian Federation, TJ Tajikistan, TM Turkmenistan, and any other State which is a Contracting State of the Eurasian Patent 
Convention and of the PCT 

13 EP European Patent: AT Austria, BE Belgium, CH and LI Switzerland and Liechtenstein, CY Cyprus, DE Germany, 
DK Denmark, ES Spain, FI Finland, FR France, GB United Kingdom, GR Greece, IE Ireland, IT Italy, LU Luxembourg, 
MC Monaco, NL Netherlands, PT Portugal, SE Sweden, and any other State which is a Contracting State of the European Patent 
Convention and of the PCT 

13 OA OAPI Patent: BF Burkina Faso, BJ Benin, CF Central African Republic, CG Congo, CI Cote d'lvoire, CM Cameroon, 
GA Gabon, GN Guinea, GW Guinea-Bissau, ML Maii, MR Mauritania, NE Niger, SN Senegal, TD Chad, TG Togo, and any 
other State which is a member State of OAPI and a Contracting State of the PCT (if other land of protection or treatment desired, 
specify on dotted line) 

National Patent (if other land of protection or treatment desired, specify on doited ImeJ: 

[3 AE United Arab Emirates m 

13 AL Albania gg 

[3 AM Armenia gt] lj 

[3 AT Austria [xj LU 

[3 AU Australia Jx| lv 

fxl AZ Azerbaijan gc] ma 

H BA Bosnia and Herzegovina g] 

H BB Barbados g) JVIG 

0 BG Bulgaria Jx] MK 

[3 BR Brazil 

H BY Belarus El MN Mongolia 

□ CA Canada [X) MW Malawi 



Liberia 

Lesotho 

Lithuania 

Luxembourg 
Latvia 

Morocco 

Republic of Moldova 

Madagascar 

The former Yugoslav 



Republic of Macedonia 



D3 CH and LI Switzerland and Liechtenstein 



E CN 
□ CR 
13 CU 

a cz 

0 DE 



H MX 



China [x] NO 

Costa Rica S NZ 

Cuba El PL 



Czech Republic [x] p-r 

Germany H RO 

H DK Denmark S RU 

fxl DM Dominica jx] grj 

13 EE Estonia SE 

S ES Spain H SG 

S FI Finland g] SI 

[3 CB United Kingdom gj sK 

[3 GD Grenada [xj SL 

H GE Georgia [x] TJ 

H GH Ghana El TM 

[3 GM Gambia El TR 



13 HR Croatia El TT 

13 HU Hungary ElTZ 

(3 ID Indonesia H UA 

13 IL Israel El UG 

[3 IN India El US 

13 IS Iceland 

13 JP Japan EI UZ 

[3 KE Kenya 0 VN 

H KG Kyrgyzstan ElYU 

09 KP Democratic People's Republic of Korea .... EI ZA 

S ZW 

133 KR Republic of Korea Check-boxes reserved for designating States which have 

S KZ Kazakhstan become party to the PCT after issuance of this sheet: 

13 lc Saint Lucia El .DZ. P.QopX.e.'.s . Rep.ubl ie. .qf . Algeria 

13 LK Sri Lanka E3_ - AG - Antigua .and. Barbuda . 



Mexico 

Norway 

New Zealand 

Poland 

Portugal 

Romania 

Russian Federation 

Sudan 

Sweden 

Singapore 

Slovenia 

Slovakia 

Sierra Leone 

Tajikistan 

Turkmenistan 

Turkey 

Trinidad and Tobago 

United Republic of Tanzania 

Ukraine 

Uganda 

United States of America . . 

continuation 

Uzbekistan 

Viet Nam 

Yugoslavia 

South Africa 

Zimbabwe 



Precautionary Designation Statement: In addition to the designators male aoove 



^{fieap^cant also makes under Rule 4.9(b) all other 
designations which would be permitted under the PCT except any dcsignation(s) indicated in the Supplemental Box as being excluded 
from the scope of this statement. The applicant declares that those additional designations are subject to confirmation and that any 
designation which is not confirmed before the expiration of 15 months from the priority date is to be regarded as withdrawn by the applicant 
at the expiration of that time limit (Confirmation (includmgjees) must reach the receiving Office within die 15-mondx time limit) 



Form PCT/RO/101 (second sheet) (January 2000) 



See Notes to the request form 



Sheet No. 4 Docket 



Filing date 
of earlier application 
(day/mon th/year) 


Number 
of earlier application 


Where earlier application is: 


national application: 
country 


regional application:* 
regional Office 


international application: 
receiving Office 


item(l) 

04 June 1999 
(04.06.99) 


60/137,693 


US 






item (2) 










item (3) 











Box No. VI PRIORITY CLAIM 



026-4302PC 



□ Further priority claims are indicated in the Supplemental Box 



| x| The receiving Office is requested to prepare and transmit to the International Bureau a certified copy 
— of the earlier application(s) (only if the earlier application was filed with the Office which for the 

purposes of the present international application is the receiving Office) identified above as item(s): ( 1 ) 



* Where the earlier application is an AR1PO application, it is mandatory to indicate in the Supplemental Box at least one country party to the Pans 
Convention for the Protection of Industrial Properly for which that earlier application was filad (Rule 4J0(b)(nJ). See Supplemental Box, 



Box No. Vn INTERNATIONAL SEARCHING AUTHORITY 



Choice of International Searching Authority (ISA) 

(if two or more International Searching Authorities are 
competent to carry out the international search, indicate 
the Authority chosen; the two-letter code may be used) : 

ISA/ep 



Request to use results of earlier search; reference to that search (if an earlier 
search has been carried out by or requested from the International Searching Authority) : 



Date (day'monthyear) 



Number 



Country for regional Office) 



Box No. VIP CHECK LIST; LANGUAGE OF FILING 



This international application contains 
the following number of sheets: 

request 5 



description (excluding 
sequence listing part) 

claims 

abstract 

drawings 

sequence listing part 
of description 



54 
5 
1 

21 

84 



This international application is accompanied by the item(s) marked below; 
1.0 f cc calculation sheet 

2. [x] separate signed power of attorney (Unsigned) 

3. [5g copy of general power of attorney; reference number, if any: 

4. □ statement explaining lack of signature 

5. □ priority document(s) identified in Box No. VI as item(s): 

6. □ translation of international application into (language): 

7. Q separate indications concerning deposited microorganism or other biological material 

8. f3 nucleotide and/or amino acid sequence listing in computer readable form 



Figure of the drawings which 

should accompany the abstract: Fig. 1 


1 ^ Di-cuiacira si. ZD; Transmittal better 
Language of filing of the 
international application: English 


Box No. DC SIGNATURE OF APPLICANT OR AGENT 



Next to each signature, indicate the name of the person signing and the capacity in which the person signs (tf such capacity is not obvious from reading the request). 




William S 
Agent for 



F/exler 
pplicants 



1. 


Date of actual receipt of the purported 
international application: 


For rece 




2. Drawings: 


3. 


Corrected date of actual receipt due to later but 
timely received papers or drawings completing 
the purported international application: 






| | received: 


4. 


Date of timely receipt of the required 
corrections under PCT Article 11(2): 






| | not received: 


5. 


International Searching Authority . . 
(if two or more are competent): lo A / 




6. 1 I Transmittal of search copy delayed 
1 1 until search fee is paid. 





Date of receipt of the record copy 
by the International Bureau: 



For International Bureau use only , 



Form PC77RO/101 (last sheet) (July 1998; reprint January 2000) 



See Notes to the request form 



Docket 



Sheet No. 



26-4302PC 



Supplemental Box If the Supplemental Box is not used, this sheet should not be included in the request 



r M' *? *F ° ftk lf 5 ^? J ' i A ', Vfl 55* in i u fficient to furnish all the information: in such case, write "Continuation of Bar V« 

l^tethen^ 



the space was insufficient, in particular: 



(0 



V^fiLtnTi^ZZSZl'i^^ write 
ConlmuationoJBoxNo.lll and indicate for each additional person the same type of informational reauired in B^No^lh^ni 
country of the address indicated in this Box it the annlimm ; ft!,,. /•#*„. t. '-^ofres' " ™ - m box No. 111. The 



. „ .r.u.iu.cyvr eocn aaamonai person me same type of information as reauired in Bar Nn til tu. 



(Hi) if in Box No. 11 or in any of the sub-boxes ofBoxNo. 111. the Inventor or the Inventor/applicant Is not Inventor for the B ur»»*~ 

fiv) if. in addition to theagent(s) indicated in Box No. IV. there are further agents: insuch case write "Continuation ofRnr N n tv 
and indicate for each further agent the same type of information as required in Box No IV; t -°"" nua "°" of Box No. IV 

(v) if. inBoxNo.V. the name ofany State (or OAPIJ is accompanied by the indication "patent of addition " or "certificate of additi^ 
or if. in Box No. V thenameofthe United States of America is accompanied by an indication "eoMbSua^^oT'cfnOmS^ 
m-part : in such can write "Continuation of Box No. V" and the name of each State invol V ed%rOAWanda^ 
7the7are^pXoUo7 "* '»>' or parent application and the date^a^^ 

(vij if '"Box No VI there are more than three earlier applications whose priority is claimed- in such case write "Cnnti^,,^ ~f 
Box No. VI and indicate for each additional earlier application the sLe type of informatio, ,Tre£1redin BoT&vT; ' 

(viij if in Box No VI the earlier application is an AR1PO application: in such case, write "Continuation of Box No VI" SD ecifi,,h„ 

2 If with regard to the precautionary designation statement contained in Box No. V, the applicant wishes to exclude am, SW/V) 



of novelty" and furnish that statement below. 



. ;- j j r — w.v^. VJ Muuu/iuj *urr Luncermng non-prey uataal 

write Statement concerning non-prejudicial disclosures or exceptions to lack 



Continuation of Box No. V - Designation of States 



US United States of America 



- Continuation of US Provisional Application 
Serial No. 60/137,693, filed 04 June 1999 

(04.06.99) 



Form PCT/RO/10I (supplemental sheet) (July 1998; reprint July 1999) 



See Notes to the request form 



This sheet is not pc^>fand does not count as a sheet of the internamnal application 



PCT 



FEE CALCULATION SHEET 
Annex to the Request 



fi. P e P Sncc 0ragent,, 2026-4302PC 



For receiving Office use only 



International application No. 



Date stamp of the receiving Office 



Applicant 

The Government of the United States of America as represented by the 
Secretary, Department of Health and Human Services, et al. 



CALCULATION OF PRESCRIBED FEES 

1. TRANSMITTAL FEE 



240.00 



EP 



2. SEARCH FEE 

International search to be carried out by 

(If two or more International Searching Authorities are competent in relation to the international 
application, indicate the name of the Authority which is chosen to carry out the international search.) 

3. INTERNATIONAL FEE 



990.00 



Basic Fee 

The international application contains 170 



sheets. 



140 



$10.00 



remaining sheets 



additional amount 



| $ 427. 


00 


bl 


$1,400. 


00 


b2 



Add amounts entered at bl and b2and enter total at B 



| $1,827.00 



B 



Designation Fees 

The international application contains ^5 

8 



number of designation fees 
payable (maximum 8) 



designations. 
$92.00 = | $ 



736. 00 



D 



amount of designation fee 
Add amounts entered at B and Dand enter total at I . . . 



$2,563.00 



(Applicants from certain States are entitled to a reduction of 75% of die 
international fee. Were the applicant is (or all applicants are) so entitled, the 
total to be entered at! is 25% of the sum of the amounts entered at B and D.) 

4. FEE FOR PRIORITY DOCUMENT (if applicable) 



5. TOTAL FEES PAYABLE 

Add amounts entered at T, S, I and P,and enter total in the TOTAL box 



1* 


15.00 


P 




$3 


,808.00 




TOTAL 



| I The designation fees are not paid at this time. 



MODE OF PAYMENT 

□ authorization to charge 
deposit account (see below) 

1 X| cheque 

[ [ postal money order 



| [ bank draft 
| | cash 

| [ revenue stamps 



| | coupons 

| | other (speci/y): 



DEPOSIT ACCOUNT AUTHORIZATION (this mode of payment may not be available at all receivin g Offices) 
The RO/ US [ j is hereby authorized to charge the total fees indicated above to my deposit account. 

I X | (this check- box may be marked only if the conditions for deposit accounts of the receiving Office so permit) is 
L_l hereby authorized to charge any deficiency or credit any overpayment in the total fees indicated above to my 
deposit account 



13-4500 



Deposit Account No. 



mis hereby authorized to charge the fee for preparation and transmittal of the priority document to the International 
Bureau of WIPO to my deposit account. / i x)/J # /f 

Signature^ Willia^ S. Feiler 



> my deposit \ 
02 June 2000 
Date (day/month'year) 



Form PCT/RO/101 (Annex) (January 2000) 



See Notes to the fee calculation sheet 



(12) INTERNATIONAL APPLICATION PI BUSHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
14 December 2000 (14.12.2000) 




PCT 



(10) International Publication Number 

WO 00/75338 A3 



(51) International Patent Classification 7 : 

C07K 14/18. 16/18. A61 K 38/00. 39/00 



C12N 15/51. 



(21) International Application Number: PCT/U S0O/ 1 5446 

(22) International Filing Date: 2 June 2000 (02.O6.20CX)) 



(25) Filing Language: 

(26) Publication Language: 

(30) Priority Data: 

60/137.693 



English 
English 

4 June 1999 (04.06.1999) US 



(63) Related by continuation (CON) or continuation-in-part 
(C1P) to earlier application: 

US 60/137.693 (CON) 

Filed on 4 June 1999 (04.06. 1999) 

(71) Applicant (for all designated States except US): THE 
GOVERNMENT OF THE UNITED STATES OF 
AMERICA as represented by THE SECRETARY, 
DEPARTMENT OF HEALTH AND HUMAN SER- 
VICES [US/US]: Office of Technology Transfer. National 
Institutes of Health, Suite 325, 601 1 Executive Boulevard, 
Rockville, Ml) 20852 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): YANAGI, Masayuki 
[JP/US]; 257 Congressional Lane, #402, Rockville, MD 
20852 (US). BUKH, Jens [DK/US]; 2018 Baltimore Road 



#J42. Rockville. MD 20851 (US). EMERSON, Suzanne, 
U. [US/US]: 4517 Everett Street. Kensington. MD 20895 
(US). PURCELL, Robert, H. [US/US]: 17517 White 
Ground Road, Boyds, MD 20841 (US). 

(74) Agents: FEILER, William, S. et al.: Morgan & Finnegan. 
L L.P, 345 Park Avenue. New York. NY 10154 (US). 

(81) Designated States (national j: AE. AG. AL, AM, AT. AU, 
AZ. BA. BB. BG, BR. BY. CA, CH, CN, CR. CU, CZ. DE, 
DK. DM, DZ. EE. ES. Fl. GB. GD. GE, GH, GM. HR, HU. 
ID. IL, IN. IS, JP. KE. KG, KP. KR, KZ. LC. LK. LR, LS. 
LT. LU. LV, MA, MD. MG. MK. MN, MW. MX, MZ. NO, 
NZ. PL. PT. RO, RU, SD, SE. SG, Si, SK. SL, TJ, TM. TR, 
TT, TZ. UA. UG. US. UZ, VN. YU, ZA. ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE. LS, MW. MZ. SD. SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM. AZ, BY, KG, KZ, MD. RU, TJ. TM ), European 
patent (AT, BE. CH, CY. DE, DK. ES. FL FR, GB, GR, IE, 
IT, LU, MC. NL, PT. SE), OAPI patent (BE BJ, CF, CG. 
CI. CM. GA, GN. GW, ML. MR. NE, SN. TD. TG ). 

Published: 

with international search report 

(88) Date of publication of the international search report: 

1 November 2001 

For mo-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



(54) Title: CLONED GENONE OF INFECTIOUS HEPATITIS C VIRUS OF GENOTYPE 2a AND USES THEREOF 
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5'UTR 
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Title Of Invention 

Cloned Genome Of Infectious 
Hepatitis C Virus of Genotype 2a And Uses Thereof 

Field Of Invention 

The present invention relates to molecular 
approaches to the production^x>f nucleic acid sequence 
which comprises the genome of infectious hepatitis C 
virus. In particular, the invention provides a nucleic 
acid sequence which comprises the genome of an 
ijnfectious hepatitis C virus of genotype 2a. The 
invention therefore relates to the use of the nucleic 
acid sequence and polypeptides encoded by all or part of 
15 the sequence in the development of vaccines and 

diagnostic assays for HCV and in the development of 
screening assays for the identification of antiviral 
agents for HCV. 

20 Background Of Invention 

Hepatitis C virus (HCV) has a positive-sense 
single-strand RNA genome and is a member of the genus 
Hepacivirus within the Flaviviridae family of viruses 
25 (Rice, 1996) . As for all positive-stranded RNA viruses, 

the genome of HCV functions as mRNA from which all viral 
proteins necessary for propagation are translated. 

The viral genome of HCV is approximately 9600 
nucleotides (nts) in length and consists of a highly 

30 

conserved 5' untranslated region (UTR) , a single long 
open reading frame (ORF) of approximately 9, 000 nts and 
a complex 3' UTR. The 5' UTR contains an internal 
ribosomal entry site (Tsukiyama-Kohara et al., 1992; 

35 
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Honda et al., 1996). The 3' UTR consists of a short 
variable region, a polypyrimidine tract of variable 
length and, at the 3' end, a highly conserved region of 
approximately 100 nucleotides (Kolykhalov et al., 1996; 
Tanaka et al., 1995; Tanaka et al., 1996; Yamada et al., 
1996) . The last 46 nucleotides of this conserved region 
were predicted to form a stable stem-loop structure 
thought to be critical for viral replication (Blight and 
Rice, 1997; Ito and Lai, 1997; Tsuchihara et al., 1997). 
The ORF encodes a large polypeptide precursor that is 
cleaved into at least 10 proteins by host and viral 
proteinases (Rice, 1996) . The predicted envelope 
proteins contain several conserved N-linked 
glycosylat ion sites and cysteine residues (Okamoto et 
al., 1992a). The NS3 gene encodes a serine protease and 
an RNA helicase and the NS5B gene encodes an RNA- 
dependent RNA polymerase. 

A remarkable characteristic of HCV is its 
genetic heterogeneity, which is manifested throughout 
the genome (Bukh et al., 1995). The most heterogeneous 
regions of the genome are found in the envelope genes, 
in particular the hypervariable region 1 (HVR1) at the 
N-terminus of E2 (Hijikata et al., 1991; Weiner et ai., 
1991) . HCV circulates as a quasispecies of closely 
related genomes in an infected individual. Globally, 
six major HCV genotypes (genotypes 1-6) and multiple 
subtypes (a, b, c, etc.) have been identified (Bukh et 
al., 1993; Simmonds et al., 1993). 

The nucleotide and deduced amino acid 
sequences among isolates within a quasispecies generally 
differ by < 2%, whereas those between isolates of 
different genotypes vary by as much as 35%. Genotypes 
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1, 2 and 3 are found worldwide and constitute more than 
90% of the HCV infections in North and South America, 
Europe, Russia, China, Japan and Australia (Forns and 
Bukh, 1998) . Throughout these regions genotype 1 
5 accounts for the majority of HCV infections but 
genotypes 2 and 3 each account for 5-15%. 

At present, more than 80% of individuals 
infected with HCV become chronically infected and these 
chronically infected individuals have a relatively high 

10 

risk of developing chronic hepatitis, liver cirrhosis 
and hepatocellular carcinoma (Hoofnagle, 1997) . The 
only effective therapy for chronic hepatitis C, 
interferon (IFN), alone or in combination with 
15 ribavirin, induces a sustained response in less than 50% 
of treated patients (Davis et al., 1998; McHutchinson et 
al., 1998). Consequently, HCV is currently the most 
common cause of end stage liver failure and the reason 
for about 30% of liver transplants performed in the U.S. 

20 

(Hoofnagle, 1997). In addition, a number of recent 
studies suggested that the severity of liver disease and 
the outcome of therapy may be genotype-dependent 
(reviewed in Bukh et al., 1997). In particular, these 
25 studies suggested that infection with HCV genotype lb 

was associated with more severe liver disease (Brechot, 
1997) and a poorer response to IFN therapy (Fried and 
Hoofnagle, 1995) . As a result of the inability to 
develop a universally effective therapy against HCV 
infection, it is estimated that there are still more 
than 25,000 new infections yearly in the U.S. (Alter 
1997) Moreover, since there is no vaccine for HCV, HCV 
remains a serious public health problem. 
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Despite the intense interest in the 
development of vaccines and therapies for HCV, progress 
has been hindered by the absence of a useful cell 
culture system and the lack of any small animal model 
for laboratory study. For example, while replication of 
HCV in several cell lines has been reported, such 
observations have turned out^not to be highly 

■v /-\ *-\ v~ ^\ /~i n r< l W T T K~» /-I /-J A +- A T"i +- V\ /—* V"> A «nrN -r-\ —r /— i <— k A «r-< 4- V\ r-N >""\ r~\ ~] \ r 
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animal model, other than man, for this disease. 
Consequently, HCV has been studied only by using 
clinical materials obtained from patients or 
experimentally infected chimpanzees, an animal model 
whose availability is very limited. 
15 However, several researchers have recently 

reported the construction of infectious cDNA clones of 
HCV, the identification of which would permit a more 
effective search for susceptible cell lines and 
facilitate molecular analysis of the viral genes and 
their function. For example, Yoo et al., and Dash et 
al., (1997) (1995) reported that RNA transcripts from 
cDNA clones of HCV-1 (genotype la) and HCV-N (genotype 
lb) , respectively, resulted in viral replication after 
25 transfection into human hepatoma cell lines. 

Unfortunately, the viability of these clones was not 
tested in vivo and concerns were raised about the 
infectivity of these cDNA clones iri vitro (Fausto, 
1997). In addition, both clones did not contain the 
terminal 98 conserved nucleotides at the very 3' end of 
the UTR. 

Kolykhalov et al., (1997) and Yanagi et al. 
(1997, 1998) reported the derivation from HCV strains 
H77 (genotype la) and HC-J4 (genotype lb) of cDNA clones 
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of HCV that are infectious for chimpanzees. However, 
while these infectious clones will aid in studying HCV 
replication and pathogenesis and will provide an 
important tool for development of in vitro replication 

5 and propagation systems, it is important to have 

infectious clones of more than one genotype, given the 
extensive genetic heterogeneity of HCV and the potential 
impact of such heterogeneity on the development of 
effective therapies and vaccines for HCV. 

^ In addition, synthetic chimeric viruses can be 

used to map the functional regions of viruses with 
different phenotypes. In flaviviruses and pestiviruses, 
infectious chimeric viruses have been successfully 

15 engineered to express different functional units of 
related viruses (Bray and Lai, 1991; Pletnev et aJ., 
1992, 1998; Vassilev et ai., 1997) and in some cases it 
has been possible to make chimeras between non-related 
or distantly related viruses. For instance, the IRES 

20 

element of poliovirus or bovine viral diarrhea virus has 
been replaced with IRES sequences from HCV (Frolov et 
al., 1998; Lu and Wimmer, 1996; Zhao et ai., 1999). 
Recently, the construction of an infectious chimera of 
25 two closely related HCV subtypes has been reported. The 
chimera contained the complete ORF of a genotype lb 
strain but had the 5' and 3' termini of a genotype la 
strain (Yanagi et al., 1998). 

It is important to determine whether chimeras 

30 

constructed from more divergent HCV strains are 
infectious because such chimeras could be used to define 
the functions of viral units and to dissect the immune 
response. 

35 
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Summary Of The Invention 

The present invention relates to nucleic acid 
sequence which comprises the genome of infectious 
hepatitis C virus and in particular, nucleic acid 

5 sequence which comprises the genome of infectious 

hepatitis C virus of genotype 2a. It is therefore an 
object of the invention to provide nucleic acid sequence 
which encodes infectious hepatitis C virus. Such 
nucleic acid sequence is referred to throughout the 
application as "infectious nucleic acid sequence". 

For the purposes of this application, nucleic 
acid sequence refers to RNA, DNA, cDNA or any variant 
thereof capable of directing host organism synthesis of 

5 a hepatitis C virus polypeptide. It is understood that 
nucleic acid sequence encompasses nucleic acid 
sequences, which due to degeneracy, encode the same 
polypeptide sequence as the nucleic acid sequences 

q described herein. 

The invention also relates to the use of the 
infectious nucleic acid sequences to produce chimeric 
genomes consisting of portions of the open reading 
frames of nucleic acid sequences of other genotypes 

^ (including, but not limited to, genotypes 1, 2, 3, 4, 5 

and 6) and subtypes (including, but not limited to, 
subtypes la, lb, 2a, 2b, 2c, 3a, 4a-4f, 5a and 6a) of 
HCV. For example, infectious nucleic acid sequence of 

0 the 2a strain HC-J6, described herein can be used to 
produce chimeras with sequences from the genomes of 
other strains of HCV from different genotypes or 
subtypes. Nucleic acid sequences which comprise 
sequences from two or more HCV genotypes or subtypes are 
designated "chimeric nucleic acid sequences". 
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The invention further relates to mutations of 
the infectious nucleic acid sequence of the invention 
where mutation includes, but is not limited to, point 
mutations, deletions and insertions. In one embodiment, 
a gene or fragment thereof can be deleted to determine 
the effect of the deleted gene or genes on the 
properties of the encoded virus such as its virulence 
and its. ability to replicate. In an alternative 
embodiment, a mutation may be introduced into the 
infectious nucleic acid sequences to examine the effect 
of the mutation on the properties of the virus. 

The invention also relates to the introduction 
of mutations or deletions into the infectious nucleic 
15 acid sequence in order to produce an attenuated 

hepatitis C virus suitable for vaccine development. 

The invention further relates to the use of 
the infectious nucleic acid sequence to produce 
attenuated viruses via passage in vitro or in vivo of 
the viruses produced by transfection of a host cell with 
the infectious nucleic acid sequence. 

The present invention also relates to the use 
of the nucleic acid sequence of the invention or 
25 fragments thereof in the production of polypeptides 

where "nucleic acid sequence of the invention" refers to 
infectious nucleic acid sequence, mutations of 
infectious nucleic acid sequence, chimeric nucleic acid 
sequence and sequences which comprise the genome of 
attenuated viruses produced from the infectious nucleic 
acid sequence of the invention. In one embodiment, said 
polypeptide or polypeptides are fully or partially 
purified from hepatitis C virus produced by cells 
35 transfected with nucleic acid sequence of the invention. 
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In another embodiment, the polypeptide or polypeptides 
are produced recombinantly from a fragment of the 
nucleic acid sequences of the invention. In yet another 
embodiment, the polypeptides are chemically synthesized. 

The polypeptides of the invention, especially 
structural polypeptides, can serve as immunogens in the 
development of vaccines or as antigens in the 
development of diagnostic assays for detecting the 
presence of HCV in biological samples. 

The invention therefore also relates to 
vaccines for use in immunizing mammals especially humans 
against hepatitis C. In one embodiment, the vaccine 
comprises one or more polypeptides made from the nucleic 
15 acid sequence of the invention or fragment thereof. In 

a second embodiment, the vaccine comprises a hepatitis C 
virus produced by transfection of host cells with the 
nucleic acid sequences of the invention. 

The present invention therefore relates to 
methods for preventing hepatitis C in a mammal. In one 
embodiment the method comprises administering to a 
mammal a polypeptide or polypeptides encoded by the 
nucleic acid sequence of the invention in an amount 
25 effective to induce protective immunity to hepatitis C- 

In another embodiment, the method of prevention 
comprises administering to a mammal a hepatitis C virus 
of the invention in an amount effective to induce 
protective immunity against hepatitis C. 

In yet another embodiment, the method of 
protection comprises administering to a mammal the 
nucleic acid sequence of the invention or a fragment 
thereof in an amount effective to induce protective 
35 immunity against hepatitis C. 
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The invention also relates to hepatitis C 
viruses produced by host cells transfected with the 
nucleic acid sequence of the present invention. 

The invention therefore also provides 
5 pharmaceutical compositions comprising the nucleic acid 
sequence of the invention and/or the encoded hepatitis C 
viruses. The invention further provides pharmaceutical 
compositions comprising polypeptides encoded by the 
nucleic acid sequence of the invention or fragments 
10 thereof. The pharmaceutical compositions of the 
invention may be used prophylactically or 
therapeutically . 

The invention also relates to antibodies to 
15 the hepatitis C virus of the invention or their encoded 
polypeptides and to pharmaceutical compositions 
comprising these antibodies . 

The invention also relates to the use of the 
nucleic acid sequences of the invention to identify cell 

20 

lines capable of supporting the replication of HCV in 
vitro . 

The invention further relates to the use of 
the nucleic acid sequences of the invention or their 
25 encoded viral enzymes (e.g. NS3 serine protease, NS3 
helicase, NS5B RNA polymerase) to develop screening 
assays to identify antiviral agents for HCV. 

Brief Description Of Figures 

30 

Figure 1 shows the amplification and cloning 
of hepatitis C virus genotype 2a (strain HC- J6 ch) . The 
nucleotide positions correspond to the sequence of 
' PJ6CF, a full length cDNA clone of hepatitis C virus, 
35 genotype 2a, strain HC-J6 CH - Products from polymerase 
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chain reaction are also shown. The names of the clones 
obtained from these products are indicated (number of 
clones sequenced are shown in parenthesis) . The 
composition of the full-length cDNA clone is shown at 
5 the bottom. The restriction enzymes used for cloning 

are indicated. An Xbal site in HC-J6 C h was eliminated by 
a silent substitution at position 5494. 

Figure 2 shows tree analysis of clones 
amplified from an infectious acute phase plasma pool 
generated in a chimpanzee inoculated with human plasma 
containing strain HC-J6 (Okamoto et al., 1991) as well 
as a tree of the predicted polyprotein sequence of 
HC-J6 C h and the infectious HC-J6 C h cDNA clone (pJ6CF) . 
The nucleotide positions with deletions or insertions 
were stripped in the analysis of the clones. Multiple 
sequence alignments and tree analyses were performed 
with GeneWorks (Oxford Molecular Group) (Bukh et al., 
1995) . Genotype designations are indicated. Other 
sequences included in the analysis are HC-J8 (Okamoto et 
al., 1992), genotype la infectious clone BEBE1 (Nakao et 
al., 1996), H77C (Yanagi et al., 1997); genotype lb 
infectious clone J4L6S (Yanagi et al., 1998). The scale 
in each tree indicates the calculated genetic distance. 

Figure 3 shows the alignment of the 
hypervariable region 1 sequences from 8 J6S clones of 
strain HC-J6 C h- HC-J6 C h represents the consensus amino 
acid sequence of the infectious plasma pool from an 
experimentally infected chimpanzee. HC-J6 is the 
published amino acid sequence of the original inoculum 
(Okamoto et al., 1991). 

Figure 4 shows the construction of four 
intertypic chimeric cDNA clones. White boxes are 
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sequences derived from genotype 2a clone pJ6CF, and 

o 

black boxes are sequences derived from genotype la clone 
pCV-H77C (Yanagi et al., 1997). An Ndel site (mutation 
at position 9158 of pCV-H77C) was eliminated and an 
artificial Ndel site (mutation at position 2765 of 
5 pCV-H77C) was created by site-directed mutagenesis; 
silent mutations are underlined. 

Figures 5A and 5B show the alignment of the 
nucleotide sequences of the 5' (Fig. 5A) and 3' UTRs 
(Fig. 5B) and the amino acid sequences of E2/p7/NS2 

10 

junctions (Fig, 5B) in the intertypic la, 2a chimeric 
cDNA clones. In the 5' UTR alignment, the first 39 nts 
of core believed to be important for the IRES function 
were included (Lemon and Honda, 1997). Top line: the 
15 sequence of the infectious genotype la clone pCV-H77C 

(Yanagi et al., 1997). Bottom line: the sequence of the 
infectious genotype 2a clone pJ6CF. Dot: identity with 
the sequence of H77C. Capital letter: different from the 
sequence of H77C. Dash: deletion. Bold face: initiation 

20 

or stop codon of the ORF. Underlined: Agel cleavage 
site. Arrow: putative sites in the HCV polyprotein 
cleaved by host signal peptidases. Numbering 
corresponds to the sequence of pCV-H77C. 

25 Figures 6A-6F show the nucleotide sequence of 

the infectious hepatitis C virus clone of genotype la 
strain H77C and Figures 6G-6H show the amino acid 
sequence encoded by the clone. 

2Q Figures 7A-7F show the nucleotide sequence of 

the infectious hepatitis C virus clone of genotype lb 
strain HC-J4 and Figures 7G-H show the amino acid 
sequence encoded by the clone. 

35 
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DESCRIPTION OF THE INVENTION 

The present invention relates to nucleic acid 
sequence which comprises the genome of an infectious 
hepatitis C virus. More specifically, the invention 
relates to nucleic acid sequence which encodes 
infectious hepatitis C virus of strain HC-J6 C h, genotype 
2a. The infectious nucleic -acid sequence of the 
invention is shown in SEQ ID NO;i and is contained in <* 
plasmid construct deposited with the American Type 
Culture Collection (ATCC) on May 28, 1999 and having 
ATCC accession number PTA-153. 

The invention also relates to "chimeric 
nucleic acid sequences" where the chimeric nucleic acid 
15 sequences consist of open-reading frame sequences and/or 

5' and/or 3' untranslated sequences taken from nucleic 
acid sequences of hepatitis C viruses of different 
genotypes or subtypes. 

In one embodiment, the chimeric nucleic acid 
sequence consists of sequence from the genome of 
infectious HCV of genotype 2a which encodes structural 
polypeptides and sequence from the genome of a HCV of a 
different genotype or subtype which encodes 
nonstructural polypeptides. 

Alternatively, the nonstructural region of 
infectious HCV of genotype 2a and structural region of a 
HCV of a different genotype or subtype may be combined. 
3q This will result in a chimeric nucleic acid sequence 

consisting of sequence from the genome of infectious HCV 
of genotype 2a which encodes nonstructural polypeptides 
and sequence from the genome of a HCV of a another 
genotype or subtype which encodes structural 
polypeptides. 
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Preferably, the nucleic acid sequence from the 
genome of the infectious HCV clone of genotype la 
(deposited with the ATCC on June 2, 1999 ; Figures 6A- 
6F) , or the nucleic acid sequence from the genome of the 
5 infectious HCV clone of genotype lb (ATCC accession 

number 209596; Figures 7A-7F) is used to construct the 
chimeric nucleic acid sequence with the HCV of genotype 
2a of the invention. 

It is believed that the construction of such 
chimeric nucleic acid sequences will be of importance in 
studying the growth and virulence properties of 
hepatitis C virus and in the production of candidate 
hepatitis C virus vaccines suitable to confer protection 
against multiple genotypes of HCV. For example, one 
might produce a "multivalent" vaccine by putting 
epitopes from several genotypes or subtypes into one 
clone. Alternatively one might replace just a single 
gene from an infectious sequence with the corresponding 
gene from the genomic sequence of a strain from another 
genotype or subtype or create a chimeric gene which 
contains portions of a gene from two genotypes or 
subtypes. Examples of genes which could be replaced or 
which could be made chimeric, include, but are not 
limited to, the El, E2 and NS4 genes. 

The invention further relates to mutations of 
the infectious nucleic acid sequences where "mutations" 
include, but are not limited to, point mutations, 
deletions and insertions. Of course, one of ordinary 
skill in the art would recognize that the size of the 
insertions would be limited by the ability of the 
resultant nucleic acid sequence to be properly packaged 
within the virion. Such mutations could be produced by 
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techniques known to those of skill in the art such as 
site-directed mutagenesis, fusion PCR, and restriction 
digestion followed by religation. 

In one embodiment, mutagenesis might be 
undertaken to determine sequences that are important for 
viral properties such as replication or virulence. For 
example, one may introduce ^mutation into the 

-! n <p in ^4--; I *-s.i-. n ^ , , l ^. A ~j — - ~ — . .u. «• — U. ~ i ^ • — — ±- ^ .i_ i_ ^ 
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cleavage site between the NS4A and NS4B polypeptides to 
examine the effects on viral replication and processing 
of the polypeptide . 

Alternatively, one may delete all or part of a 
gene or of the 5' or 3' nontranslated region contained in 
15 an infectious nucleic acid sequence and then transfect a 

host cell (animal or cell culture) with the mutated 
sequence and measure viral replication in the host by 
methods known in the art such as RT-PCR. Preferred 
genes include, but are not limited to, the P7, NS4B and 
NS5A genes. Of course, those of ordinary skill in the 
art will understand that deletion of part of a gene, 
preferably the central portion of the gene, may be 
preferable to deletion of the entire gene in order to 
conserve the cleavage site boundaries which exist 
between proteins in the HCV polyprotein and which are 
necessary for proper processing of the polyprotein. 

In the alternative, if the transfection is 
into a host animal such as a chimpanzee, one can monitor 
the virulence phenotype of the virus produced by 
transfection of the mutated infectious nucleic acid 
sequence by methods known in the art such as measurement 
of liver enzyme levels (alanine aminotransferase (ALT) 
or isocitrate dehydrogenase (ICD) ) or by histopathology 
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of liver biopsies. Thus, mutations of the infectious 
nucleic acid sequences may be useful in the production 
of attenuated HCV strains suitable for vaccine use. 

The invention also relates to the use of the 
5 infectious nucleic acid sequence of the present 

invention to produce attenuated viral strains via 
passage In vitro or in vivo pf the virus produced by 
transfection with the infectious nucleic acid sequence. 

The present invention therefore relates to the 

10 

use of the nucleic acid sequence of the invention to 
identify cell lines capable of supporting the 
replication of HCV. 

In particular, it is contemplated that the 
15 mutations of the infectious nucleic acid sequence of the 
invention and the production of chimeric sequences as 
discussed above may be useful in identifying sequences 
critical for cell culture adaptation of HCV and hence, 
may be useful in identifying cell lines capable of 

20 

supporting HCV replication. 

Transfection of tissue culture cells with the 
nucleic acid sequences of the invention may be done by 
methods of transfection known in the art such as 
25 electroporation, precipitation with DEAE-Dextran or 
calcium phosphate or liposomes. 

In one such embodiment, the method comprises 

the growing of animal cells, especially human cells, in 

vitro and transfecting the cells with the nucleic acid 
30 

of the invention, then determining if the cells show 
indicia of HCV infection. Such indicia include the 
detection of viral antigens in the cell, for example, by 
immunofluorescence procedures well known in the art; the 
35 detection of viral polypeptides by Western blotting 
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using antibodies specific therefor; and the detection of 
newly transcribed viral RNA within the cells via methods 
such as RT-PCR. The presence of live, infectious virus 
particles following such tests may also be shown by 
5 injection of cell culture medium or cell lysates into 

healthy, susceptible animals, with subsequent exhibition 
of the signs and symptoms of>HCV infection. 

C <. ~ 4 .. "L~. "1 1 T ^ 1 1 1 J «C T J T T ^-»t » 

include, but are not limited to, lymphocyte and 
hepatocyte cell lines known in the art . 

Alternatively, primary hepatocytes can be 
cultured, and then infected with HCV; or, the hepatocyte 
cultures could be derived from the livers of infected 
chimpanzees. In addition, various immortalization 
methods known to those of ordinary skill in the art can 
be used to obtain cell lines derived from hepatocyte 
cultures. For example, primary hepatocyte cultures may 
be fused to a variety of cells to maintain stability. 

The present invention further relates to the 
in vitro and in vivo production of hepatitis C viruses 
from the nucleic acid sequences of the invention. 

In one embodiment, the sequences of the 
invention can be inserted into an expression vector that 
functions in eukaryotic cells. Eukaryotic expression 
vectors suitable for producing high efficiency gene 
transfer in vivo are well known to those of ordinary 
skill in the art and include, but are not limited to, 
plasmids, vaccinia viruses, retroviruses, adenoviruses 
and adeno-associated viruses. 

In another embodiment, the sequences contained 
in the recombinant expression vector can be transcribed 
in vitro by methods known to those of ordinary skill in 
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the art in order to produce RNA transcripts which encode 
the hepatitis C viruses of the invention. The hepatitis 
C viruses of the invention may then be produced by 
transfecting cells by methods known to those of ordinary 
5 skill in the art with either the in vitro transcription 
mixture containing the RNA transcripts or with the 
recombinant expression vectors containing the nucleic 
acid sequences described herein. 

The hepatitis C viruses produced from the 

^ sequences of the invention may be purified or partially 
purified from the transfected cells by methods known to 
those of ordinary skill in the art. In a preferred 
embodiment, the viruses are partially purified prior to 

15 their use as immunogens in the pharmaceutical 

compositions and vaccines of the present invention. 

The present invention therefore relates to the 
use of the hepatitis C viruses produced from the nucleic 
acid sequences of the invention as immunogens in live or 

20 

killed ( e.g. , formalin inactivated) vaccines' to prevent 
hepatitis C in a mammal. 

In an alternative embodiment, the immunogen of 
the present invention may be an infectious nucleic acid 
25 sequence, a chimeric nucleic acid sequence, or a mutated 
infectious nucleic acid sequence which encodes a 
hepatitis C virus. Where the sequence is a cDNA 
sequence, the cDNAs and their RNA transcripts may be 
used to transfect a mammal by direct injection into the 

30 

liver tissue of the mammal as described in the Examples. 

Alternatively, direct gene transfer may be 
accomplished via administration of a eukaryotic 
expression vector containing a nucleic acid sequence of 
35 the invention . 
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In yet another embodiment, the immunogen may 
be a polypeptide encoded by the nucleic acid sequences 
of the invention. The present invention therefore also 
relates to polypeptides produced from the nucleic acid 
5 sequences of the invention or fragments thereof- In one 

embodiment, polypeptides of the present invention can be 
recombinantly produced by synthesis from the nucleic 
acid sequences of the invention or isolated fragments 
thereof, and purified, or partially purified, from 
transfected cells using methods already known in the 
art. In an alternative embodiment, the polypeptides may 
be purified or partially purified from viral particles 
produced via transfection of a host cell with the 
nucleic acid sequences of the invention. Such 
polypeptides might, for example, include either capsid 
or envelope polypeptides prepared from the sequences of 
the present invention. 

When used as immunogens, the nucleic acid 
sequences of the invention, or the polypeptides or 
viruses produced therefrom, are preferably partially 
purified prior to use as immunogens in pharmaceutical 
compositions and vaccines of the present invention. 
When used as a vaccine, the sequences and the 
polypeptide and virus products thereof, can be 
administered alone or in a suitable diluent, including, 
but not limited to, water, saline, or some type of 
buffered medium. The vaccine according to the present 
invention may be administered to an animal, especially a 
mammal, and most especially a human, by a variety of 
routes, including, but not limited to, intradermally, 
intramuscularly, subcutaneously, or in any combination 
thereof . 
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Suitable amounts of material to administer for 
prophylactic and therapeutic purposes will vary 
depending on the route selected and the immunogen 
(nucleic acid, virus, polypeptide) administered. One 
skilled in the art will appreciate that the amounts to 
be administered for any particular treatment protocol 
can be readily determined wijthout undue experimentation. 
The vaccines of the present invention may be 
administered once or periodically until a suitable titer 
of anti-HCV antibodies appear in the blood. For an 
immunogen consisting of a nucleic acid sequence, a 
suitable amount of nucleic acid sequence to be used for 
prophylactic purposes might be expected to fall in the 
15 range of from about 100 |ig to about 5 mg and most 

preferably in the range of from about 500 jig to about 
2mg. For a polypeptide, a suitable amount to use for 
prophylactic purposes is preferably 100 ng to 100 \xg and 
20 a virus 10 2 to 10 6 infectious doses. Such 

administration will, of course, occur prior to any sign 
of HCV infection. 

A vaccine of the present invention may be 
employed in such forms as capsules, liquid solutions, 
suspensions or elixirs for oral administration, or 
sterile liquid forms such as solutions or suspensions. 
An inert carrier is preferably used, such as saline or 
phosphate-buffered saline, or any such carrier in which 
30 the HCV of the present invention can be suitably 

suspended. The vaccines may be in the form of single 
dose preparations or in multi-dose flasks which can be 
utilized for mass-vaccination programs of both animals 
and humans. For purposes of using the vaccines of the 
present invention reference is made to Remington f s 
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Pharmaceutical Sciences , Mack Publishing Co., Easton, 
Pa., Osol (Ed.) (1980); and New Trends and Developments 
in Vaccines , Voller et al. (Eds.), University Park 
Press, Baltimore, Md. (1978), both of which provide much 
5 useful information for preparing and using vaccines. Of 

course, the polypeptides of the present invention, when 
used as vaccines, can include, as part of the 

aa*«vva.-,-:-4--.' a vs Av MM .<.1 n ^ A « — , , J 4 l_ "I 1 _I J- 1_ 
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alum (or aluminum hydroxide) when humans are to be 
vaccinated, to further stimulate production of 
antibodies by immune cells. When nucleic acids, viruses 
or polypeptides are used for vaccination purposes, other 
specific adjuvants such as CpG motifs (Krieg, A.K. et 
al.(1995) and (1996)), may prove useful. 

When the nucleic acids, viruses and 
polypeptides of the present invention are used as 
vaccines or inocula, they will normally exist as 
physically discrete units suitable as a unitary dosage 
for animals, especially mammals, and most especially 
humans, wherein each unit will contain a predetermined 
quantity of active material calculated to produce the 
desired immunogenic effect in association with the 
required diluent. The dose of said vaccine or inoculum 
according to the present invention is administered at 
least once. In order to increase the antibody level, a 
second or booster dose may be administered at some time 
after the initial dose. The need for, and timing of, 
such booster dose will, of course, be determined within 
the sound judgment of the administrator of such vaccine 
or inoculum and according to sound principles well known 
in the art. For example, such booster dose could 
reasonably be expected to be advantageous at some time 



WO 00/75338 



PCTAJS00/15446 



- 21 - 

between about 2 weeks to about 6 months following the 
initial vaccination. Subsequent closes may be 
administered as indicated. 

The nucleic acid sequences, viruses and 

5 polypeptides of the present invention can also be 

administered for purposes of therapy, where a mammal, 
especially a primate, and most especially a human, is 
already infected, as shown by well known diagnostic 
measures. When the nucleic acid sequences, viruses or 

10 polypeptides of the present invention are used for such 
therapeutic purposes, much of the same criteria will 
apply as when it is used as a vaccine, except that 
inoculation will occur post-infection. Thus, when the 

15 nucleic acid sequences, viruses or polypeptides of the 
present invention are used as therapeutic agents in the 
treatment of infection, the therapeutic agent comprises 
a pharmaceutical composition containing a sufficient 
amount of said nucleic acid sequences, viruses or 

20 

polypeptides so as to elicit a therapeutically effective 
response in the organism to be treated. Of course, the 
amount of pharmaceutical composition to be administered 
will, as for vaccines, vary depending on the immunogen 
25 contained therein (nucleic acid, polypeptide, virus) and 
on the route of administration. 

The therapeutic agent according to the present 
invention can thus be administered by subcutaneous, 
intramuscular or intradermal routes. One skilled in the 

30 

art will certainly appreciate that the amounts to be 
administered for any particular treatment protocol can 
be readily determined without undue experimentation. Of 
course, the actual amounts will vary depending on the 
35 route of administration as well as the sex, age, and 
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clinical status of the subject which, in the case of 
human patients, is to be determined with the sound 
judgment of the clinician. 

The therapeutic agent of the present invention 
5 can be employed in such forms as capsules, liquid 

solutions, suspensions or elixirs, or sterile liquid 
forms such as solutions or §iaspensions . An inert carrier 
is preferably used, such as saline, phosphate-buffered 
saline, or any such carrier in which the HCV of the 
present invention can be suitably suspended. The 
therapeutic agents may be in the form of single dose 
preparations or in the multi-dose flasks which can be 
utilized for mass-treatment programs of both animals and 
humans. Of course, when the nucleic acid sequences, 
viruses or polypeptides of the present invention are 
used as therapeutic agents they may be administered as a 
single dose or as a series of doses, depending on the 
situation as determined by the person conducting the 
treatment . 

The nucleic acids, polypeptides and viruses of 
the present invention can also be utilized in the 
production of antibodies against HCV. The term 
"antibody" is herein used to refer to immunoglobulin 
molecules and immunologically active portions of 
immunoglobulin molecules. Examples of antibody 
molecules are intact immunoglobulin molecules, 
substantially intact immunoglobulin molecules and 
portions of an immunoglobulin molecule, including those 
portions known in the art as Fab, F(ab') 2 and F(v) as 
well as chimeric antibody molecules. 

Thus, the polypeptides, viruses and nucleic 
acid sequences of the present invention can be used in 
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the generation of antibodies that immunoreact (i.e., 
specific binding between an antigenic determinant- 
containing molecule and a molecule containing an 
antibody combining site such as a whole antibody 
molecule or an active portion thereof) with antigenic 
determinants on the surface of hepatitis C virus 
particles . ^ 

The present invention therefore also relates 
to antibodies produced following immunization with the 
nucleic acid sequences , viruses or polypeptides of the 
present invention. These antibodies are typically 
produced by immunizing a mammal with an immunogen or 
vaccine to induce antibody molecules having 
15 immunospecif icity for polypeptides or viruses produced 

in response to infection with the nucleic acid sequences 
of the present invention . When used in generating such 
antibodies, the nucleic acid sequences, viruses, or 
polypeptides of the present invention may be linked to 
some type of carrier molecule. The resulting antibody 
molecules are then collected from said mammal. 
Antibodies produced according to the present invention 
have the unique advantage of being generated in response 
25 to authentic, functional polypeptides produced according 
to the actual cloned HCV genome. 

The antibody molecules of the present 
invention may be polyclonal or monoclonal. Monoclonal 
antibodies are readily produced by methods well known in 
the art . Portions of immunoglobin molecules, such as 
Fabs, as well as chimeric antibodies, may also be 
produced by methods well known to those of ordinary 
skill in the art of generating such antibodies . 
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The antibodies according to the present 
invention may also be contained in blood, plasma, serum, 
hybridoma supernatants, and the like. Alternatively, 
the antibody of the present invention is isolated to the 
5 extent desired by well known techniques such as, for 

example, using DEAE Sephadex. The antibodies produced 
according to the present invention may be further 
purified so as to obtain specific classes or subclasses 
of antibody such as IgM, IgG, IgA, and the like. 
Antibodies of the IgG class are preferred for purposes 
of passive protection. 

The antibodies of the present invention are 
useful in the prevention and treatment of diseases 
caused by hepatitis C virus in animals, especially 
mammals, and most especially humans. 

In providing the antibodies of the present 
invention to a recipient mammal, preferably a human, the 
dosage of administered antibodies will vary depending on 
such factors as the mammal's age, weight, height, sex, 
general medical condition, previous medical history, and 
the like. 

In general, it will be advantageous to provide 
the recipient mammal with a dosage of antibodies in the 
range of from about 1 mg/kg body weight to about 10 
mg/kg body weight of the mammal, although a lower or 
higher dose may be administered if found desirable. 
Such antibodies will normally be administered by 
intravenous or intramuscular route as an inoculum. The 
antibodies of the present invention are intended to be 
provided to the recipient subject in an amount 
sufficient to prevent, lessen or attenuate the severity, 
extent or duration of any existing infection. 
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The antibodies prepared by use of the nucleic 
acid sequences, viruses or polypeptides of the present 
invention are also highly useful for diagnostic 
purposes. For example, the antibodies can be used as in 
vitro diagnostic agents to test for the presence of HCV 
in biological samples taken from animals, especially 
humans. Such assays include^, but are not limited to, 
radioimmunoassays, EIA, fluorescence, Western blot 
analysis and ELISAs. In one such embodiment, the 
biological sample is contacted with antibodies of the 
present invention and a labeled second antibody is used 
to detect the presence of HCV to which the antibodies 
are bound. 

15 Such assays may be, for example, direct where 

the labeled first antibody is immunoreactive with the 
antigen, such as, for example, a polypeptide on the 
surface of the virus; indirect where a labeled second 
antibody is reactive with the first antibody; a 
competitive protocol such as would involve the addition 
of a labeled antigen; or sandwich where both labeled and 
unlabeled antibody are used, as well as other protocols 
well known and described in the art. 
25 In one embodiment, an immunoassay method would 

utilize an antibody specific for HCV envelope 
determinants and would further comprise the steps of 
contacting a biological sample with the HCV-specific 
antibody and then detecting the presence of HCV material 
in the test sample using one of the types of assay 
protocols as described above. Polypeptides and 
antibodies produced according to the present invention 
may also be supplied in the form of a kit, either 
35 present in vials as purified material, or present in 
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compositions and suspended in suitable diluents as 
previously described . 

In a preferred embodiment, such a diagnostic 
test kit for detection of HCV antigens in a test sample 
comprises in combination a series of containers, each 
container a reagent needed for such assay. Thus, one 
such container would contain^ a specific amount of HCV- 
specific antibody as already described, a second 
container would contain a diluent for suspension of the 
sample to be tested, a third container would contain a 
positive control and an additional container would 
contain a negative control. An additional container 
could contain a blank. 

For all prophylactic, therapeutic and 
diagnostic uses, the antibodies of the invention and 
other reagents, plus appropriate devices and 
accessories, may be provided in the form of a kit so as 
to facilitate ready availability and ease of use. 

The present invention also relates* to the use 
of nucleic acid sequences and polypeptides of the 
present invention to screen potential antiviral agents 
for antiviral activity against HCV. Such screening 
methods are known by those of skill in the art. 
Generally, the antiviral agents are tested at a variety 
of concentrations, for their effect on preventing viral 
replication in cell culture systems which support viral 
replication, and then for an inhibition of infectivity 
or of viral pathogenicity (and a low level of toxicity) 
in an animal model system. 

In one embodiment, animal cells (especially 
human cells) transfected with the nucleic acid sequences 
of the invention are cultured in vitro and the cells are 
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treated with a candidate antiviral agent (a chemical, 
peptide etc.) by adding the candidate agent to the 
medium. The treated cells are then exposed, possibly 
under transfecting or fusing conditions known in the 
art, to the nucleic acid sequences of the present 
invention. A sufficient period of time would then be 
allowed to pass for infection to occur, following which 
the presence or absence of viral replication would be 
determined versus untreated control cells by methods 
known to those of ordinary skill in the art. Such 
methods include, but are not limited to, the detection 
of viral antigens in the cell, for example, by 
immunofluorescence procedures well known in the art; the 
15 detection of viral polypeptides by Western blotting 
using antibodies specific therefor; the detection of 
newly transcribed viral RNA within the cells by RT-PCR; 
and the detection of the presence of live, infectious 
virus particles by injection of cell culture medium or 
cell lysates into healthy, susceptible animals, with 
subsequent exhibition of the signs and symptoms of HCV 
infection. A comparison of results obtained for control 
cells (treated only with nucleic acid sequence) with 
25 those obtained for treated cells (nucleic acid sequence 
and antiviral agent) would indicate, the degree, if any, 
of antiviral activity of the candidate antiviral agent. 
Of course, one of ordinary skill in the art would 
readily understand that such cells can be treated with 
the candidate antiviral agent either before or after 
exposure to the nucleic acid sequence of the present 
invention so as to determine what stage, or stages, of 
viral infection and replication said agent is effective 
35 against. 
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In an alternative embodiment, viral enzyme 
such as NS3 protease, NS2-NS3 protease, NS3 helicase or 
NS5B RNA polymerase may be produced from a nucleic acid 
sequence of the invention and used to screen for 
inhibitors which may act as antiviral agents. The 
structural and nonstructural regions of the HCV genome, 
including nucleotide and amipo acid locations, have been 
determined, for example, as depicted in Houghton, M. 
(1996), Fig. 1; and Major, M.E. et al. (1997), Table 2. 

Such above-mentioned protease inhibitors may 
take the form of chemical compounds or peptides which 
mimic the known cleavage sites of the protease and may 
be screened using methods known to those of skill in the 
art (Houghton, M. (1996) and Major, M.E. et al. (1997)). 
For example, a substrate may be employed which mimics 
the protease 1 s natural substrate, but which provides a 
detectable signal (e.g. by fluorimetric or colorimetric 
methods) when cleaved. This substrate is then incubated 
with the protease and the candidate protease inhibitor 
under conditions of suitable pH, temperature etc. to 
detect protease activity. The proteolytic activities of 
the protease in the presence or absence of the candidate 
inhibitor are then determined. 

In yet another embodiment, a candidate 
antiviral agent (such as a protease inhibitor) may be 
directly assayed _in vivo for antiviral activity by 
administering the candidate antiviral agent to a 
chimpanzee transfected with a nucleic acid sequence of 
the invention or infected with a virus of the invention 
and then measuring viral replication iri vivo via methods 
such as RT-PCR. Of course, the chimpanzee may be 
treated with the candidate agent either before or after 
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transfection with the infectious nucleic acid sequence 
or infected with a virus of the invention so as to 
determine what stage, or stages, of viral infection and 
replication the agent is effective against. 

The invention also provides that the nucleic 
acid sequences, viruses and polypeptides of the 
invention may be supplied in^the form of a kit, alone or 
in the form of a pharmaceutical composition. 

All scientific publication and/or patents 
cited herein are specifically incorporated by reference. 
The following examples illustrate various aspects of the 
invention but are in no way intended to limit the scope 
thereof . 

EXAMPLES 
Materials and Methods 



Source of HCV 

An infectious plasma pool of HCV genotype 2a 
(HC-J6 C h) prepared from acute phase plasma of a 
chimpanzee experimentally inoculated with plasma from a 
Japanese patient infected with strain HC-J6 (Okamoto et 
al., 1991) was used for cloning. An infectious cDNA 
25 clone of HCV strain H77, genotype la was also used 
(pCV-H77C; Yanagi et al., 1997). 
Amplification, cloning and sequence analysis 

Viral RNA was extracted from 100 ^il aliquots 
30 of the HC-J6 CH plasma pool with the TRIzol system 

(GIBCO/BRL) (Yanagi et al., 1997). Primers used in cDNA 
synthesis and PCR amplification were based on the 
genomic sequence of strain HC-J6 (Okamoto et al,, 1991) 
and from the conserved region (3'X) of the 3' UTR of HCV 
genotype 2a (Tanaka et al., 1996) (Table 1). The RNA 
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was denatured at 65°C for 2 min, and cDNA was 
synthesized at 42°C for 1 hour with Superscript II 
reverse transcriptase (GIBCO/BRL) and specific reverse 
primers in 20 \xl reaction volumes. The cDNA mixtures 
were treated with RNase H and RNase Tl (GIBCO/BRL) at 
37°C for 20 min. 
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TABLE 1 

Oligonucleotides used for amplification and cloning 
of strain HC-J6 C h, genotype 2a 



Designation 



2427S-H77 

2426S-H77 

2645R-H77 

2832R-H77 

H2751SII 

H2786R 

H2870R 

H7851S 

H9140S {M) 

H9173R (M) 

H9471R 

J6-H2556S 

356RF-J6H 

333S-J6 

753R-J6 

2543S-J6F 

2787R-J6(26) 

3329R-J6 

5487-J6F 

5518R-J6F 

9251S-J6F 

9305R-J6F 

9310R-J6F 

9399S-J6F 

9464-J6F 

9470(24)-J6 

J6-3' XR 



Sequence (5' 



3' )a 



ACTGGACACGGAGGTGGCCGCGTC 
TTGTTCTTGTCGGGTTAATGGCGC 
GGGTGTACTACACACATGAGTAAG 
AAGCGCCCCTAACTGATGATG 

CGT CATCGATA CCTCAGCGGG CATATG CACTGGACACGGA 

GTCCAGTG CATATG CCCGCTGAGG 

C AT GC AC CAGCTGAT AT AG CGCT TGTAAT ATG 

TCCGTAGAGG AAGCTT GCAGCCTGACGCCC 

CAGAGGAGGCAGGGTGCTATATGTGGCAAGTAC 

GTACTTGCCACATATAGCAGCCCTGCCTCCTCTG 

CGT CTCTAGA CAGGAAATGG CTTAAG AGGCCGGAGTGTTTACC 

TTATG GATGCTCATC TTGTTGGGCCAGGCCGAAGCAGCTTTGGAGAACCTCGTAATACT 
CAATGC 

AGG AT TT GT GCTC AT GGTG CACG GT CT ACG AG 
' inn m rVVr r GCGGCCGG TAAI&CGACTCACm 

CCGTGCACCATGAGCACAAATCCTAAACCTC 

GGATGTACCCCATGAGGTCGGCAAAG 

GTTTGCGCCTGCTTAT GGATGCTCATC TTG 

GCGTCATAA GCATATG CCTGTTGGGG 

CCCTCAGCACTGGAGTACATCTG 

CGTCATGCATACCCCTAGGGCGGCTCTCATTGAAGAGGG 

CGTCCCCTCTTCAATGAGAGCCG CTCTAGA 

GCGGTGAAGACCAAGCTCAAACTCACTC 

AATCTASAAGGCGCGCTTCCGGCAATGGAGTGAGTTTGAGC 

CGT CTCTAGA GGATAAATCCAGGAGGCGCGCTTCCGGC 

TACTTTTTGTAGGGGTAGGCCTTTTCC 

CGT CTCTAGA GTGTAGCTAATGTGTGCCGCTCTA 

CTATGGAGTGTAGCTAATGTGTGC 

CGT CTCTAGA CATGATCTGCAGAGAGACCAGTTACGGCACTCTCTGFCAGTCATGCGGC 
TCACGGACCTTTCACAGCTAGCCGTGACTAGGGCTAAGATGGAGCCACC 



HCV-specific sequences are shown in plain text, non HCV-specific 
sequences are shown in bold face, and cleavage sites used for cDNA 
cloning are underlined. 

The core sequence of the T7 promotor is shown in italics. 



35 



The strategy used to amplify and clone the 
full-length HC-J6 C h sequence is shown in Fig. 1. 
Nucleotide positions correspond to those of the 2a 
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infectious clone (pJ6CF) that is described herein. The 
5' end of HC-J6 C h (nts. 17-297, excluding primer 
sequences) was amplified from 2 |il of cDNA synthesized 
with primer a-2 (Yanagi et al., 1996). PCR was performed 
5 with AmpliTaq Gold DNA polymerase (Perkin-Elmer) as 

described previously (Yanagi et aJ. f 1996) using primers 
1S-J6F and a-2. After purification, the amplified 
products were cloned into pGEM-T Easy vector (Promega) 
using standard procedures and 5 clones (pJ6-5'UTR) were 

10 

sequenced. 

The 3' end of HC-J6 CH was amplified in 3 
overlapping pieces. RT-PCR of a short fragment of NS5B 
(nts. 9279-9439) was performed with primers 9251S-J6F 

15 and 9464R-J6F as described above. The PCR products were 
cloned into pGEM-T Easy vector and sequence analysis was 
performed from 5 pJ6-3'F clones. A second region 
spanning from NS5B to the conserved region of the 3' UTR 
(nts. 9376-9629) was amplified in RT-nested PCR 
(external primers H9261F and H3'X58R, internal primers 
H9282F and H3'X45R) . (Yanagi et al., 1997). The amplified 
products were cloned into pGEM-9zf (-) by using Hindi 1 1 
and Xbal sites and 14 pJ6-3'VR clones were sequenced. 

25 The third fragment, which included the 3' terminal 
sequence was amplified with primers 9399S-J6F and 
J6-3'XR from one of the pJ6-3'VR clones, and cloned into 
one of the pJ6-3'F clones by using StuI and Xbal sites 

3Q (pJ6-3»X) . 

The ORF of HCV HC-J6 CH was amplified by long 
RT-PCR in 3 overlapping pieces. The amplification was 
performed on 2 nl of the cDNA mixtures with the 
Advantage cDNA polymerase mix (Clontech) (Yanagi et al., 
35 1997) . The J6S fragment (nts. 86-2761) was amplified 
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with primers a-1 (Yanagi et al., 1996) and J6-2787R from 
cDNA synthesized with primer J6-3329R. A single PCR 
round was performed in a Robocycler thermal cycler 
(Stratagene) , and consisted of denaturation at 99°C for 
5 35 sec, annealing at 67°C for 30 sec and elongation at 

68°C for 4 min 30 sec during the first 5 cycles, 5 min 
during the next 10 cycles, 5" min 30 sec during the 
following 10 cycles and 6 min during the last 10 cycles, 
10 The J6B fragment (nts. 2573-5488) was amplified with 

primers 2543S-J6F and 5518R-J6F from cDNA synthesized 
with primer 5518R-J6F. Finally, the J6A fragment (nts. 
5515-9282) was amplified with primers 5487S-J6F and 
9310R-J6F from cDNA synthesized with primer 
^ 9470R (24 ) - J6F. PCR amplifications of fragments J6B and 

J6A consisted of denaturation at 99°C for 35 sec, 
annealing at 67°C for 30 sec and elongation at 68°C for 6 
min during the first 5 cycles, 7 min during the next 10 
20 cycles, 8 min during the following 10 cycles. and 9 min 

during the last 10 cycles. 

After purification of the long PCR products 
with QIAquick PCR purification kit (QIAGEN) , A-tailing 
reactions were performed with AmpliTaq DNA polymerase 
(Perkin Elmer) at 72 °C for 1 hour. The gel-purified 
A-tailed PCR products were cloned into pCR2 . 1 vector 
(Invitrogen) or pGEM-T Easy vector (Promega) . DH5-alpha 
competent cells (GIBCO BRL) were transformed and 
selected on LB agar plates containing 100 ng/ml 
ampicillin (SIGMA) and amplified in LB liquid cultures 
at 30°C for 18 - 20 hrs (Yanagi et al., 1997). Midiprep 
was performed using Wizard Plus Midipreps DNA 
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Purification System (Promega) . Multiple clones of the 

o 

J6S, J6A and the J6B fragments were sequenced. 

The consensus sequence of strain HOJ6 C h (nts. 
17-9629) was determined by direct sequencing of PCR 
products (nts. 297-3004 and nts. 4893-5762) and by 
5 sequence analysis of the TA clones (nts. 17-5488 and 
nts. 5515-9629) (Fig. 1). Both strands of DNA were 
sequenced in all cases. Analyses of genomic sequences, 
including multiple sequence alignments and tree 
analyses, were performed with GeneWorks (Oxford 

10 

Molecular Group) (Bukh et al., 1995). 

Construction of chimeric cDNA clones of genotypes la & 
2a 

Four full-length intertypic chimeric cDNA 
15 clones were constructed (Figs. 4, 5A, 5B) . In each clone 
the C, El and E2 genes encoded the consensus amino acid 
sequence of HC-J6 C h- The p7 protein was encoded either by 
the HC-J6 C h or pCV-H77C consensus sequence, and the NS 
20 proteins were all encoded by pCV-H77C genes. To 

engineer these cDNA clones, an Ndel site from pCV-H77C 
was first eliminated by a silent substitution (C to T) 
at position 9158. In brief, two fragments were 
amplified from pCV-H77C with primers H78 51S and 

25 

H9173R(M) and with primers H9140S(M) and H9417R (Table 
3) , gel-purified and used for fusion PCR with primers 
H7851S and H9417R. The fusion PCR products were cloned 
into pCV-H77C by using Hindlll and Aflll sites. A new 
30 artificial Ndel site was introduced by a silent 

substitution (C to T) at position 2765. PCR products, 
which were amplified from pCV-H77C with primer H2751SII 
containing artificial Clal and Ndel sites and primer 
H2870R, were cloned into the modified pCV-H77C by using 
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Clal and Eco47III sites. The final construct (pH77CV) 
was used as a cassette vector to construct the 
intertypic chimeric HCV cDNA clones. 

The four chimeric cDNA clones were constructed 
5 as follows, pH77CV-J6S (nucleotide sequence shown in 

SEQ ID No: 3 and amino acid sequence shown in SEQ ID 
No:4): The Agel/BsmI fragment of clone J6S2 and the 
Bsml/ Ndel fragment of clone J6S1, were cloned into 
pH77CV by using Agel and Ndel sites; pH77 (p7)CV-J6S 
(nucleotide sequence shown in SEQ ID No: 5 and amino acid 
sequence shown in SEQ ID No: 6): A fragment of pH77CV-J6S 
was replaced with a fragment amplified from pCV-H77C 
with primers J6-H2556S and H2786R by using BsaBI and 
Ndel sites; J6S (nucleotide sequence shown in SEQ ID 
No:7 and amino acid sequence shown in SEQ ID No:8): A 
fragment amplified from pH77pCV-H77C with primers a-1 
and 356RF-J6H77 and another fragment amplified from 
pH77CV-J6S with primers 333S-J6 and 753R-J6 were 
gel-purified and a fusion-PCR was performed with primers 
a-1 and 753R-J6. The Agel/ Clal fragment of the 
subcloned fusion PCR products and the Clal/Ndel fragment 
of pH77CV-J6S were cloned into pH77CV-J6S by using ^gel 
and Ndel sites; pH77(p7)-J6S (nucleotide sequence shown 
in SEQ ID No: 9 and amino acid sequence shown in SEQ ID 
No: 10): The Agel/Clal fragment of J6S and the Clal/Ndel 
fragment of (p7)CV-J6S were cloned into pH77 (p7 ) CV- J6S 
by using Agel and Ndel sites. 

Each intertypic chimeric cDNA clone was 
retransf ormed to select a single clone, and large-scale 
preparation of plasmid DNA was performed with a QIAGEN 
plasmid Maxi kit as described previously (Yanagi et al., 
1997). Each of the four cDNA clones was completely 
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sequenced before inoculation. Each clone was 
genetically stable since the digestion pattern was as 
expected following retransf ormation and the complete 
sequence was the expected one. 

Construction of full-length cDNA clone HC-J6 C h 

An overview of the full-length HC-J6 C h clone is 
presented in Fig, 1. In the^final construct pJ6CF, 
which encodes the consensus polyprotein of HC-J6 C Hf an 
Xbal site was eliminated by a silent substitution (A to 
G) at position 5494, Digested fragments containing the 
consensus sequence were purified from the appropriate 
subclones and ligated using the sites indicated- The 
full-length cDNA clone (pJ6CF) was retransf ormed to 
select a single clone, and large-scale preparation of 
plasmid DNA followed by the complete sequence analysis 
was performed. Clone pJ6CF was genetically stable. 

Intrahepatic transfection of chimpanzee with transcribed 
20 RNA 

In duplicate 100 \xl reactions, RNA was 
transcribed in vitro with T7 RNA polymerase (Promega) 
from 10 |j.g of template plasmid linearized with Xbal 
(Promega) as described previously (Yanagi et al., 1997). 
The integrity of the RNA was checked by electrophoresis 
through agarose gel stained with ethidium bromide 
(Yanagi et al., 1997). Each transcription mixture was 
diluted with 400 y.1 of ice-cold phosphate-buffered 
saline without calcium or magnesium and then immediately 
frozen on dry ice and stored at -80°C. Within 24 hours, 
both transcription mixtures were injected into the same 
chimpanzee by percutaneous intrahepatic injection guided 
35 by ultrasound (Yanagi et al., 1998, 1999). If the 
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chimpanzee did not become infected, the same 
transfection was repeated once. After two negative 
results, the next clone was inoculated into the same 
chimpanzee following the same protocol. Injections were 
performed at weeks 0 and 2 with pH77CV-J6S, at weeks 5 
and 8 with pH77 (p7)CV-J6S, at weeks 14 and 16 with 
pH77-J6S, at weeks 19 and 23. with pH77 (p7 ) - J6S, at week 
28 with . pJGCF, and finally at week 34 with pCV K77C. 
The chimpanzee was maintained under conditions that met 
or exceeded all requirements for its use in an approved 
facility. 

Serum samples were collected weekly from, the 
chimpanzee and monitored for liver enzyme levels by 
standard procedures, anti-HCV antibodies by the 
second-generation ELISA (Abbott) and HCV RNA by a 
sensitive RT-nested PCR assay with AmpliTaq Gold DNA 
polymerase using primers from the 5' UTR (Yanagi et al., 
1996) . Samples were scored as negative for HCV RNA if 
two independent tests on 100 \xl of serum were negative. 
The genome equivalent (GE) titer of HCV in positive 
samples was determined by RT-nested PCR on 10-fold 
serial dilutions of the extracted RNA (Bukh et al., 
1998). The consensus sequence of the complete ORF from 
the chimpanzee infected with RNA transcripts of pJ6CF 
was determined by direct sequencing of overlapping PCR 
products obtained by long RT-nested PCR as previously 
described (Yanagi et al., 1997) with HC-J6 specific 
primers. After the intrahepatic transfection with RNA 
transcripts of pCV-H77C, we performed H7 7 (genotype la)- 
specific RT-nested PCR with primers 2427S-H77 and 
2832R-H77 for the 1st round and with primers 2462S-H77 
and 2645R-H77 for the 2nd round (Table 3) . The 
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sensitivity of this assay was equivalent to that of the 
assay using 5' UTR primers when testing serum containing 
only H77, genotype la. The genome titer of genotype la 
was determined by using this specific RT- nested PCR on 
10-fold serial dilutions of the extracted RNA. 

EXAMPLE 1 

Sequence analysis of HCV strain HC-J6 C h 

As minor deviations from the consensus amino 
acid sequence were found previously to render 
full-length HCV cDNA clones noninfectious (Yanagi et 
al., 1997, 1998), the consensus sequence of the cloning 
source of genotype 2a (strain HC-J6 CH ) was determined 
prior to constructing any full-length clones . In brief, 
a plasma pool containing strain HC-J6 C h was prepared from 
acute phase plasmapheresis units collected from a 
chimpanzee experimentally infected with HC-J6 (Okamoto 
et al., 1991). The HCV genome titer of this pool was 
10 5 - 4 genome equivalents (GE) /ml (Quantiplex HCV RNA 
bDNA 2.0, Chiron) and the infectivity titer was 10 4 
chimpanzee infectious doses/ml . 

The consensus sequence of the 5 ? UTR of HC-J6 C h 
(nts. 17-340) was deduced from 5 clones containing nts. 
17-297 and 8 clones containing nts. 86-340. The 5 f UTR 
of the various clones was highly conserved, but the 
consensus sequence of HC-J6 C h differed by 2 nucleotides 
from that published previously for HC-J6 (Okamoto et 
al., 1991: C to T at position 36 and T to C at position 
222) . 

The consensus sequence of 14 clones of the 3 ' 
UTR of HC-J6 CH indicated that the 39 nucleotide long 
variable region was highly conserved in this strain and 
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was identical to that previously published for HC-J6 
(Okamoto et ai., 1991). The polypyrimidine tract varied 
greatly in length (84-164 nucleotides) , and contained 
some conserved A residues. In the conserved region, the 
proximal 16 nucleotides were identical to those 
previously published for isolates of different HCV 
genotypes (Kolykhalov et ai.^ 1996; Tanaka et al., 1996; 
Yair.ada ct ai., 1996) . The remaining 82 nucleotides of 
the conserved region were determined for other genotype 
2a strains (Tanaka et al., 1996) but not for HC-J6 or 
HC— J6ch * 

The ORF of HC-J6 C h was amplified in 3 fragments 
by RT-PCR (Fig. 1) . Eight clones of the J6S fragment 
15 (nts. 86-2761), 6 clones of the J6B fragment (nts. 

2573-5488) and 6 clones of the J6A fragment (nts. 
5515-9298) were sequenced. PCR fragments containing 
nts. 5489-5514 were sequenced directly. A quasispecies 
was found at 243 nucleotide (2.7%) and 69 amino acid 
(2.3%) positions, scattered throughout the 9099 nts 
(3033 aa) of the ORF. However, the majority, 231 
nucleotide substitutions, were detected only once and 
71.6 % of these represented silent mutations. The 12 
25 remaining nucleotide substitutions were each restricted 
to 2 clones and only 4 of these resulted in amino acid 
changes. The nucleotide difference among the J6S clones 
ranged from 0.1 - 1.3%, among the J6B clones it ranged 
from 0.1 - 0.3%, and it ranged from 0.2 - 4.0% among the 
J6A clones (Fig. 2) . Three of 8 J6S clones, 4 of 6 J6B 
clones, and all 6 J6A clones had defective polyproteins 
due to nucleotide deletions, insertions or 
substitutions . 
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The sequences of clones of strain HC-J6 C h were 



relatively homogeneous. This was highlighted by the 
high degree of conservation among clones of the HVRl 
(Fig. 3) , a region frequently used to study the 
5 quasispecies of HCV (Bukh et al., 1995). An exception 
was the sequence of clone J6A1, which differed by about 
4% from the other clones of J^his region (Fig. 2) . 
Importantly, the consensus sequence of strain HC- J6 C h 
(nts. 17-9629) could be determined with no ambiguity at 
the nucleotide or deduced amino acid level . The 
difference between the consensus ORF sequence of HC-J6 C h 
from the experimentally infected chimpanzee and that of 
HC-J6 of the inoculum (Okamoto et al., 1991) was 4.1 % 
15 and 2.2 % at the nucleotide and deduced amino acid 

levels, respectively (Fig. 2, Table 2). Moreover, we 
found that 12 (44.4%) of the 27 amino acids constituting 
HVRl differed between HC-J6 CH and HC-J6 (Fig. 3) . Such 
diversities are greater than the < 2 % generally 

20 

considered to comprise a quasispecies. In fact, these 
differences are equivalent to those found between the 
two prototype strains of HCV genotype la [strains HCV-1 
(Choo et al., 1991) and H77 (Yanagi et al., 1997)]. 
25 These results indicated that HC-J6 C h, which represented 
the major species in the experimentally infected 
chimpanzee, was a minor species in the original 
inoculum. 
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Percent difference of nucleotide and predicted amino acid sequences 
between strain HC-J6 (Okamoto et al., 1991) and strain HC-J6 CH from 
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Genome Region 


nt .position* 


% nt. 


difference 


% a. 


a . 


difference 


ORF 


341-9439 


4.1 


(373/9099)* 


2. 


2 


(66/3033)* 


5' UTR 


17-340 


^ 0.6 


(2/324) 








Core 


341-913 


^ 0.5 


(3/573) 


0 


(0/191) 


El - 


914-148 9 


4 . 3 


(25/576) 


2. 


i 


(4/192) 


HVR1 


1490-1570 


24 .7 


(20/81) 


44 . 


4 


(12/27) 


E2-HVR1 


1571-2590 


3.9 


(40/1020) 


3. 


2 


(11/340) 


p7 


2591-2779 


3.7 


(7/189) 


3. 


2 


(2/63) 


NS2 


2780-3430 


4.0 


(26/651) 


2. 


8 


(6/217) 


NS3 


3431-5323 


4.0 


(76/1893) 


0. 


8 


(5/631) 


NS4A 


5324-5485 


4.3 


(7/162) 


1. 


9 


(1/54) 


NS4B 


5486-62 68 


3.7 


(29/783) 


0. 


4 


(1/261) 


NS5A 


6269-7666 


5.4 


(75/1398) 


3. 


4 


(16/466) 


NS5B 


7667-9439 


3.7 


(65/1773) 


1. 


4 


(8/591) 


3' UTR 


9440-9481 


0 (0/42) 









20 



a The nucleotide positions correspond to those of the infectious 

full-length genotype 2a clone (pJ6CF) . 
jb The numbers in parenthesis indicate the nucleotide or amino acid 

differences for each region. 

Example 2 

Chimeric molecular clones 

As chimeric flaviviruses with substituted 
structural genes have been useful in defining the 
biological function of viral sequences or proteins, in 
analyzing immune responses and in generating attenuated 
vaccine candidates (Bray and Lai, 1991; Chambers et al., 
1999; Pletnev et al., 1992, 1993, 1998). The consensus 
sequence of the 2a structural genes and surrounding 
region was substituted for that of the infectious la 
cDNA clone. In the genotype la backbone, two silent 
mutations were introduced for cloning purposes [at 
positions 2765 (p7) and 9158 (NS5B) of pCV-H77C] (Fig. 
4). The complete sequence of each chimera was verified. 
Infectivity of RNA transcripts from four different 
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intertypic chimeric clones (Figs. 4, 5A, 5B) was evaluated 
by consecutive intrahepatic transf ections of a chimpanzee. 
Clones were considered not to be viable if viral RNA was not 
detected in the serum within two weeks of the repeat 
transf ection. All chimeric clones contained the C, El and E2 
genes of genotype 2a. The two chimeric clones tested 
initially differed from each^ other in that one had the p7 
gene of 2a (pH77CV-J6S) and the other [pH77 (p7 ) CV- J6S] the 
p7 gene of la. They differed from the two other clones in 
that the 186 nucleotides of the 5' UTR just upstream of the 
initiation codon were from the 2a genotype. Since neither 
clone containing the chimeric 5' UTR was infectious, the 
chimeric 5' UTR was replaced with the consensus genotype la 
5 f UTR to generate the two p7 varieties [pH77-J6S and 
15 pH77 (p7 ) - J6S] . After consecutive transf ection of the four 
clones, no HCV RNA, anti-HCV or ALT elevation was detected 
in the chimpanzee during 28 weeks of follow-up, suggesting 
that RNA transcripts from these intertypic chimeric clones 
were not viable in vivo. 

20 

This finding that the intertypic clones between 
genotypes la and 2a were not viable was surprising since 
flavivirus chimeras containing the structural region of 
dengue virus type 1 or 2 or of tick-borne encephalitis virus 
2 ^ and the nonstructural region of an infectious dengue type 4 
virus were viable (Bray and Lai, 1991; Pletnev et al., 1992, 
1993) . While considerable sequence variation exists between 
the infectious genotype la and 2a clones of HCV (Table 3), 
these viruses exhibit a higher degree of genetic 
heterogeneity than do the major genotypes of HCV. For other 
f laviviruses, however, it was possible to obtain 
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infectious chimeric clones only if the capsid region was 
derived from the backbone cDNA clone (Chambers et al., 
1999; Pletnev and Men, 1998). 



TABLE 3 

Percent difference of the amino acid sequences between 
the infectious clone of genotype la (pCV-H77C; 
Yanagi et al., 1997) --and the infectious clone of 
genotype 2a (pJ6CF) of hepatitis C virus 



Genome Region 9 


% difference 


Polyprotein 


27 . 9 (839/3007)* 


Core 


8.9 (17/191) 


El 


37 .0 (71/192) 


HVR1 


59.3 (16/27) 


E2~HVR1 


27. 1 (91/336) 


p7 


38 . 1 (24/63) 


NS2 


41.9 (91/217) 


NS3 


19.2 (121/631) 


NS4A 


33.3 (18/54) 


NS4 B 


26.8 (70/261) 


NS5A 


38 . 5 (171/444) 


NS5B 


25 . 2 (149/591) 



a Genome regions defined as in Table 1. 



jb The numbers in parenthesis indicate the amino 
acid differences for each region. 
Positions with deletions or insertions in E2 (4 
aa positions) and NS5A (26 aa positions) were 
not considered. 

Trivial explanations may account for the lack 
of viability of these intertypic chimeras. First, the 
two silent mutations introduced in the genotype la 
backbone (one in p7 and one in NS5B) for cloning 
purposes could potentially eliminate infectivity. This 
is, however, very unlikely since mutations at these 
positions exist among field isolates of HCV including 
strain HC-J6 CH (Bukh et al., 1998). Also, it is 
noteworthy that the three previously published 
infectious clones of strain H77 had numerous silent 
nucleotide differences (Hong et al., 1999; Kolykhalov et 
al., 1997; Yanagi et al., 1997). Second, signal 
peptidases might not cleave the chimeric E2/p7 or p7/NS2 
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junction. This seems unlikely, however, since 
eukaryotic signal peptidases typically recognize the 
amino acid sequences upstream of the cleavage site [the 
(-3, -1) rule] (Nielsen et al., 1997) and the amino 
acids at these two sites are conserved between genotypes 
la and 2a (Fig. 5B) . Finally, the E2/p7 and/or p7/NS2 
gene junctions could differ between genotypes la and 2a. 
The junctions determined for genotypes la and lb were 
used (Lin et al '. , 1994; Mizushima et al., 1994; Selby et 
al., 1994) because those for genotype 2a have not been 
identified. In the latter two cases, further analyses 
of genotype 2a should eventually provide sufficient data 
to overcome such potential problems and it would most 
likely be possible to construct a viable chimera. 
15 More complicated explanations for the lack of 

viability of the chimeras might be required if critical 
genotype-specific interactions occur as regards the 
structural proteins, the nonstructural proteins and the 
genomic RNA. For instance, one cannot rule out that the 
chimeras were not viable because the IRES function was 
compromised. In in vitro studies the IRES activity 
depended on RNA sequences not only in the 5 1 UTR but 
also extending 3' of the translation initiation site 
25 (Hahm et al., 1998; Lemon and Honda, 1997; Reynolds et 
al., 1995). Although the 3' border of the HCV IRES is 
still controversial it is believed to involve at most 
the first 39 nts of the core gene (Lemon and Honda, 
3Q 1997) . The 5' UTR of the intertypic chimeras was either 
a chimera of genotype la and 2a sequences or the entire 
5 1 UTR was derived from the la clone (Figs. 4, 5A) . 
Importantly, the 5 f end of core is conserved among 
genotypes la and 2a (Fig. 5A) . Thus, the predicted 

35 
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IRES-like secondary structure is maintained in these 
chimeras, suggesting that the IRES activity most likely 
was maintained. 

Possible interactions between the structural 
5 proteins and the nonstructural proteins and/or the 

genomic RNA, which involve RNA packaging, replication or 
translation are conceivable.^ In poliovirus, which is 
another positive-sense RNA virus, functional coupling of 
RNA packaging to RNA replication and of RNA replication 

10 

to translation have been suggested (Novak and 
Kirkegaard, 1994 ; Nugent et al., 1999). Similar to 
other viruses of the Flaviviridae family, a membrane- 
associated replicase complex is thought to initiate 
15 replication at the 3 1 end of HCV and to synthesize a 
complementary negative-strand RNA (Rice, 1996) . The 
putative cis-acting elements at the 5' and 3 1 termini 
which are believed to be important for viral genome 
replication (Rice 1996; Frolov et al., 1998) should be 

20 

maintained in the intertypic HCV chimeras at' least in 
the two constructs with the authentic la 5'UTR. 
However, it is conceivable that the viral packaging 
system was interrupted (Frolov et ai., 1998). Studies 
25 using a Kunjin flavivirus replicon system and providing 
the structural proteins in trans suggested that the 
essential encapsidation signals did not reside in the 
structural region of the genome (Khromykh et al., 1997, 
1998) . The location of the packaging signals of HCV is 
not known. However, if the structural proteins 
encapsidate viral RNA via genotype-specific sequences 
outside of the structural region, the chimeras would be 
unable to package the RNA and it might be extremely 
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difficult to construct viable chimeras between highly 
divergent strains . 

Example 3 

A consensus molecular clone of 
genotype 2a is infectious in vivo 

In order to prove that the genotype 2a portion 
used in the 4 intertypic chimeric cDNA clones indeed 
represented the infectious sequence, a consensus full- 
length cDNA clone of HC-J6 C h (pJ6CF) was constructed. 
The core sequence of the T7 promoter, a 5' guanosine 
residue and the full-length sequence of HC-J6 C h (9711 
nts) were cloned into pGEM-9Zf vector using Notl/Xbal 
sites. Within the HCV sequence there were no deduced 
amino acid differences and only 4 nucleotide differences 
(at nucleotide positions 1822, 5494, 9247 and 9289) from 
the consensus sequence of HC-J6 CH as determined in the 
present study. The silent mutation at position 1822 was 
within the structural region and so was also- present, in 
the four intertypic chimeras . The 5 1 terminal 16 nts 
and the 3' terminal 82 nts were deduced from previously 
published HCV genotype 2a sequences (Okamoto et al., 
25 1991, Tanaka et al., 1996). The full-length cDNA clone 
of genotype 2a contained a 5 1 UTR of 340 nts, an ORF of 
9099 nts encoding 3033 amino acids and a 3' UTR 
consisting of a variable region of 39 nts followed by a 
132 nucleotide-long polypyrimidine tract interrupted 
with 3 A residues and the 3' terminal conserved region 
of 98 nts. 

RNA transcripts from pJ6CF were injected into 
the same chimpanzee used for injection of the 4 
35 intertypic chimeras. The chimpanzee became infected at 
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the first attempt with an HCV titer of 10 2 GE/ml at week 

1 post inoculation (p.i.), and 10 3 -10 4 GE/ml during weeks 

2 to 6 p.i. The consensus sequence of PCR products of 
the complete ORF, amplified from serum obtained during 
week 5 p.i., was identical to the sequence of pJ6CF and 
there was no evidence of a quasispecies . Since RNA 
transcripts of this infectious genotype 2a clone were 
infectious zn vivo, and it shared an exact sequence with 
the non-infectious intertypic chimeric clones, their 
failure to replicate must have been the result of 
incompatibilities between 'the genotype la and 2a 
sequences . 

To confirm that the chimpanzee used was 
susceptible also to infection by genotype la, which 
comprised most of the intertypic chimeras, the 
chimpanzee was subsequently inoculated with RNA 
transcripts from the infectious genotype la clone 
(pCV-H77C) . Serum samples were tested in an 
H77-specific RT-PCR assay to identify super-infection 
with genotype la. At week 1 p.i. the total HCV genome 
titer was 10 4 GE/ml and the H77-specific (la) genome 
titer was 10 2 GE/ml. The H77-specific genome titer 
increased to 10 3 GE/ml at week 2 p.i., and reached 10 4 
GE/ml during weeks 3-6 p.i. The consensus sequence of 
PCR products amplified with H77-specific primers at 
weeks 1-6 p.i. were found to be identical to that of 
pCV-H77C. However, the direct sequences of PCR products 
amplified with the 5' UTR primers at weeks 1-2 after 
inoculation of pCV-H77C were identical to that of pJ6CF 
indicating that the 2a genotype was still present and 
represented the majority species. These experiments 
confirmed that the inability of the intertypic la, 2a 
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cDNA clones to infect the chimpanzee was not the result 
of protective immune responses in the chimpanzee but 
represented deficiencies intrinsic to the chimeras. 

Discussion 

The published infectious cDNA clones of HCV 
represent the two most important subtypes of genotype 1 
(Hong et al., 1999; Kolykhalov et al., 1997; Yanagi et 
al., 1997, 1998). However, 5 more major genotypes of 
HCV are recognized. In the above Examples, the 
infectivity of a cDNA clone of a second major HCV 
genotype was demonstrated- As in previous studies, the 
infectivity of RNA transcripts was demonstrated in vivo 
by intrahepatic transfection of a chimpanzee. This new 
infectious clone (pJ6CF) encodes the consensus 
polyprotein of HCV strain HC-J6 C h/ genotype 2a. Its 
encoded polyprotein differs from those of the infectious 
clones of genotypes la and lb by approximately 30% 
(Table 2) . Genotype 2 strains, in particular subtypes 
2a and 2b, have a worldwide distribution and important 
differences between genotypes 1 and 2 with respect to 
pathogenesis and treatment were indicated in previous 
studies • The availability of an infectious clone 
representing a second major genotype of HCV should 
permit new ways of studying the molecular biology and 
immunopathology of this important and genetically quite 
different human pathogen. 

The 5' and 3' UTRs of HCV are believed to be 
critical for viral replication, translation and viral 
packaging (Rice, 1996). The 5' 203 terminal nucleotides 
and the 3 1 101 terminal nucleotides of the published 
infectious clones of genotypes la and lb were identical . 
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However, the sequences of UTRs of the genotype 2a clone 
differ from those of the genotype 1 clones. Overall, 
the 5' UTR of the genotype 2a clone has 17 nt 
differences and a single nucleotide deletion compared 
with the infectious clones of genotype la (Fig. 5A) . 
Five of these differences and the deletion are within 
the first 30 nucleotides, whereas the remainder are 
found within the predicted IRES structure. Differences 
also exist between the 3' UTR of the genotype 2a clone 
and the clones of genotype la (Fig. 5B) . The sequences 
of the variable region are very different. Recent study 
has shown this region is not critical for infectivity in 
vivo (Yanagi et al . , 1999). Within the regions which 
are critical for infectivity in vivo (Yanagi et al., 
15 1999) , the 132 nucleotide-long polypyrimidine tract of 
the genotype 2a clone has 3 unique A residues 
interspersed and the 3' terminal conserved region of 98 
nts has 4 nt differences within the 3' terminal stable 
stem-loop structure (Fig. 5B) (Kolykhalov et al., 1996; 
TanaJca et al., 1996). Since the 2a clone was infectious 
these sequence differences are apparently real and are 
compatible with infectivity. Further studies are 
required to determine whether these represent critical 
25 genotype-specific sequences. 
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WHAT IS CLAIMED IS: 

1. A purified and isolated nucleic acid 
molecule which encodes human hepatitis C virus of 

^ genotype 2a, said molecule capable of expressing said 
virus when trans f ected into cells . 

2. The nucleic arcid molecule of claim 1, 
wherein said molecule encodes the amino acid sequence of 

10 SEQ ID NO:2. 

3. The nucleic acid molecule of claim 2, 
wherein said molecule comprises the nucleic acid 
sequence of SEQ ID NO:l. 

15 

4 . A DNA construct comprising a nucleic acid 
molecule according to claim 1 . 

5 . A DNA construct comprising a nucleic acid 
molecule according to claim 3 . 

6. An RNA transcript of the DNA construct of 

claim 4 . 

7. An RNA transcript of the DNA construct of 

claim 5. 

8. A cell transf ected with the DNA construct 
of claim 4 . 

30 9. A cell transf ected with the DNA construct 

of claim 5. 

10. A cell transf ected with RNA transcript of 

claim 6. 
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-se- 
ll. A cell transfected with RNA transcript of 

claim 7. 

12. A hepatitis C virus polypeptide produced 
by the cell of claims 8 or 9. 

13. A hepatitis C virus polypeptide produced 
by the cell of claims 10 or ^11. 

14. A hepatitis C virus produced by the cell 
10 of claims 8 or 9. 

15. A hepatitis C virus produced by the cell 
of claims 10 or 11. 

15 16. A hepatitis C virus whose genome 

comprises a nucleic acid molecule according to claim 1. 

17. A hepatitis C virus whose genome 
comprises a nucleic acid molecule according to claim 3. 

18. A method for producing a hepatitis C 
virus comprising transfecting a host cell with the RNA 
transcript of claims 6 or 7 . 

25 19. A polypeptide encoded by a nucleic acid 

sequence according to claim 1. 

20. A polypeptide encoded by a nucleic acid 
sequence according to claim 3. 

30 

21. The polypeptide of claim 19, wherein said 
polypeptide is selected from the group consisting of NS3 
protease, El protein, E2 protein or NS4 protein. 
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22. The polypeptide of claim 20, wherein said 
polypeptide is selected from the group consisting of NS3 
protease, El protein, E2 protein or NS4 protein. 

23. A method for assaying candidate antiviral 

5 ... 

agents for activity against HCV, comprising: 

a) exposing a cell containing the hepatitis 

C virus of claims 16 or 17 to the 

candidate antiviral agent; and 

10 b) measuring the presence or absence of 

hepatitis C virus replication in the cell 

of step (a) . 

24. The method of claim 23, wherein said 

15 replication in step (b) is measured by at least one of 

the following: negative strand RT-PCR, quantitative RT- 
PCR, Western blot, immunof luoresence, or infectivity in 
a susceptible animal . 
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25. A method for assaying candidate antiviral 
agents for activity against HCV, comprising: 

a) exposing an HCV protease encoded by a 
nucleic acid sequence according to claims 
1 or 3 or a fragment thereof to the 
candidate antiviral agent in the presence 
of a protease substrate ; and 

b) measuring the protease activity of said 
protease. 

26. The method of claim 25, wherein said HCV 
protease is selected from the group consisting of an NS3 
domain protease, an NS3-NS4A fusion polypeptide, or an 
NS2-NS3 protease. 
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27. An antiviral agent identified as having 
antiviral activity for HCV by the method of claim 23. 

28. An antiviral agent identified as having 
antiviral activity for HCV by the method of claim 25. 

29. Antibody to the polypeptide of claim 19. 

30. Antibody to the polypeptide of claim 20, 

31. Antibody to the hepatitis C virus of 

claim 16. 

32. Antibody to the hepatitis C virus of 

claim 17 . 

33. A method for determining the 
susceptibility of cells in vitro to support HCV 
infection, comprising the steps of: 

a) growing animal cells in vitro; 

b) transfecting into said cells the nucleic 
acid of claim 1 ; and 

c) determining if said cells show indicia of 
HCV replication. 

34. The method according to claim 33, wherein 
said cells are human cells. 

35. A composition comprising a polypeptide of 
claim 19 suspended in a suitable amount of a 
pharmaceutically acceptable diluent or excipient. 



36. A composition comprising a polypeptide of 
claim 20 suspended in a suitable amount of a 
pharmaceutically acceptable diluent or excipient. 
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37. A composition comprising a nucleic acid 
molecule of claim 1 suspended in a suitable amount of a 
pharmaceutically acceptable diluent or excipient. 
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GQCftGQOOOC 1GMX3353GC GfteaCIOCAC CATCAAICAC TOOX'IUIUA. 50 

QGAftCIftCIG TCITCACGCA GAAAGG3ICT TD^SIKEGftG 100 

T Gimm c ft G ocieraoGAC (xxxreiar oosBGAQocft. aacnMb ium 150 

OQGAAOGQGT GRGIWM3 GAftTIQOCftG GAQ3ADCX333 1 CC1T1LT1U 200 

GKEAAAOOOG CICAATOCCT GGAG^iTlUJ GOGflGOODOC QCAflGftCIQC 250 

IBGOOGftGIA GIGTIQ3SIC GO3AAAG30C TIGlGSEftCr GOCIGMffiGS 300 

GIQCTIQOGA. GTGCUXUGS AGG'ICIOSIA GAOJG'lUCflC CATCfiGCaaS 350 

AASOCTAAAC CICAAfiGAAA AftCCAAACCT AACAOCAADC GIOQOOC&Cfc 400 

GGAOSICAftG TIOCOX G IG QCQ3ICAGAT CL,T1UJ1UA GTlTACl ' mi ' 450 

TOGOQCQCAG GGGCX'iiAGA, TIQ3SIGIQC Q3303A03flG GAftGft LTlUL ' 500 

GftGQaSIOGC AA02T03AG3 TftGAOSICBG OCTAIOGCEA AQaCAOGICG 550 

G3XX2M333C AGGftUC' l CJG G CTCAQCOQQS GTB LULTIUj OOX'IL'IAIG 600 

GCAA1GAGG3 TIGCGGG' I GG QCX393ATO3C TOCIGICTCC OXf l OjClL T 650 

OQQOCIBGCr GQGGOOOCAC AGAOOCCOQS CXJEAGGTOGC OCAATnOOS 700 

TAAGGICATC GATADOCTIA CGIQ033CTT CGCD3ACCKZ 750 

TACCGCTOGT CGG0GXQCT CITOGPO30G CIQ0CW333C CCIOJ33CAT 800 

Q30GICO333 TTCIGGAftGA. O330GIGAAC TMQCAAC&G GGAAOCTCOC 850 

T3GIT3CTCT TlUiL'imLT 'lULTlCiUGC OCTGCICICT TOCCTGACIG 900 

TQOOCGCTIC AGQCEAOCAA. GIQOQCAA2T CCICGGGGCT TEACCATCIC 950 

TOCAAIGftTT GQQCTAACIC GAGTATTOIG TAG3AG3333 OQGftHQOCAT 1000 

OCIQCACACT O0Q333IGIG TaXTTOOGfT TOQ0GAG3GT AAD3QCIQGA 1050 

OjlUl'lUGbT GQQ03IGAOC CXXHAOQGfIG3 OCACCAG3GA OQQCftAACIC 1100 

CCXZACAAOQC AQCITD3A03 TCAXA2CGAT L'lULTlUlLU G3M30G0CAC 1150 

CJL ' I UIU L ' ILU GCECTCTACG TOQQQGAOCT G'iULUJi'ICT GICITIUITG 1200 

TIGGflCAACT GTimJUriU '1U1UOCAG3C GOCACIOoftC G7«33ZAAGAC 1250 

TOCAATIUIT CiML ' imUJ OQQCX^aTOA GCAIOXMG 1300 

ATCAACIGGT CXXJLTAOQQC AG0GTIQ3IG GFPGCICPGZ 1350 

TGCIOOGS A T CXX2£AAG3C 1GAIOCT3G 1QCICACIQ3 1400 

GGftGIOCIGS OQQ3ZATM3C G ' imTlUlUJ MG3I0333A 1450 

GG ' IUL ' IGGTA . GIOJIUL'IUC TA liTl UULUj 03TOGft03CG GAAaOOCftOG 1500 

1CAOCGQ333 AAftlQOOQQC aXSOCACQS CiGUiJl'lGT 1UJ1U1ULT1 1 1550 

ACAOCAG3GG GCMGCaGAA CATCCAfcCIG AICAACftOCA. ACQ33GTIG 1600 

QCACMCAAT AQCACQ30CT TCAATIQCAA, TGAAAQQCTT AftCWD03XT 1650 

QGTEAGCAQ3 QCICTICIKr CAAC&CAAAT TCAACIUTIC AUJL'IUIULT 1700 

GAGPiQGTIGS OCM3CIQOQC3 ADQOCTTWDC GKHTIGOOC ^mUL'lUJGG 1750 

TOCTMPCAGT TKIQOCAAQG GAAGOQQQCT CGAOGAW33C OOCDOQCT 1800 

QQCACIBGQC TOCAftGftGCT TGIQSCAlTIG TO0003CAAA GPUUG'iG'lUT 1850 

GQQQOQGTAT ATIQCTICAC TCOCAGQOOC G'iUJIUJiGS GAACGAOCGA 1900 
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CA03IO333C GCGOCIAGCT 70030333 TOGAAA1GAT AQ33AID3ICT 1950 

TOGIDCnAA OttCAGCAGS OCAQ33CIQ3 GCAATK33TT O^riUllft OC 2000 

TOGAIGAACT C&ACIGGATT CAOCAAAGIG TO033AG33C OCXXTTOIGT 2050 

CA3033AG33 GT333CAACA ACAULT1ULT CIG33QCACT GATIG3TICC 2100 

GCAAACATOC G3AAG0CACA. TACICTO93T 003330333 TGQdGaATT 2150 

ACAD3CAG3T G3A1Q3T03A. CIA0333EAT AG3CTTK33C AL'IMULTll* 2200 

TAOCAICAAT 'COOCATAT TCAAAGTCAG GATCEAD3IG G3AQ333I03 2250 

AGCACAG3CT G3AAG3330C T33AACIQ3A G33333333A AD3CIGIGAT 2300 

CIG3AAGACA. G33ACAG3TC 03AGCTCAGC 033TIGCT3C TGIOCAOCAC 2350 

ACAGIG3CAG GTOCTTOCJ3T GTIUITIUAC GAUUL'lUOCA. ULLT1U1ULA 2400 

O3330CTCAT OCADCIOCAC CAGAACATIG T33A0GTGCA. GEALTiUUAC 2450 

G333TAG33T CAAQCA303C G1LC1U333C ATEAAGIQ33 2500 

lUlUL'lUl'lC CriCIGCTIG CAGA03a303 GGIC1GCIGC T3CTIGIG3A. 2550 

T3AIGITACT CA3XP33CAA GCG3AG3033 CTTIG3AGAA OCID3TAATA 2600 

CICAAIGCAG CATOOCIGGC 0333AD3CAC G3ICTIGIGT ULTlUL'iUbT 2650 

GTILT I L ' HX ' TIT333TG3T A3UIGAAG33 TAG3IG33TG G00Q3AG333 2700 

TCEADQQOCT CEAD3QSAIG TOQCCICIOC '1CC1UCIOCT GCIG303TIG 2750 

OCTCAGCX3QG CATAD3CACT G3ACAD33AG GICGCCGGGT GGIG1GXG3 2800 

CGl ' lUl ' lL ' lT GTO333TTAA. TO333CIGAC TCIGTD3CCA. TATEACAAGC 2850 

GCTAIATCAG CIG3IQCAIG T33IG3CTIC ab'lMlTlLT GAOCAGAGTA. 2900 

GAAG0QCAAC 1GCA03IG1G yyi'lUUUUUL' CICAAGGTOC 0333333333 2950 

O3ATOG03IC AICTI^CICA. TCIGTGTBGT ACAG003AGC CTG3IMTIG 3000 

ACATCACCAA ACI^L'ICCIG QOCASCTICG GACUUL'l'l'lU GATICTICAA 3050 

QQCAGITIQC TIAAAGTOOC CIBCTICGIG 0333TICAAG (JULT1U1ULU 3100 

GATCIG090G CTAQ3333GA. AGAIAGC033 A33TCATEAC G1GCAAAT33 3150 

OCATCATCAA. GirAG33333 CITACB33CA UL'imUlUlA TAAOCATCIC 3200 

AOCJQCICnC GAGACIQ33C GCACAA033C CIG03AGAIC 'lUJULUlUX: 3250 

T3IG3AAOCA. GIOSICTICT C033AATO3A GACCAAGCIC MCA03TG33 3300 

G33CAGATAC 0330333TQC G3IGACATCA, TCAA033CIT G0003ICICT 3350 

GO00GIBG33 GGCAGGAGAT TOGCTIG33 OCAGCOGACG GAA3Q3ICIC 3400 

CAAG333TG3 AQGTIGCIG3 03332ATCAC 0333TA0333 CAGCAGAOSA 3450 

GAQ3CCID3T A033IGTATA. ATCACCAGOC TCACIG3333 GGACAAAAAC 3500 

CAAGIGGAG3 GIGAGG I OC A GAIUUIG'I CA . ACIGCTAOOC AAAOJl'lUCT 3550 

G3CAAOGHX3C ATCAATO333 TA3GCIG3AC 'lUILTADCAC G3333333AA. 3600 

O3AGGA0CAT CGCA3CADCC AAG33TCCIG TGA3CCAGAT GEA3AOCAAT 3650 

GTGGAOCAAG AOCTIGIG33 C 1 UUUU0UL T OCICAAG3TT OQ03CICATT 3700 

GACAOXTGT ACCIG033CT OCI033ACCT TUOI'IUJIU AOGAQ3CA03 3750 

G3GA3GICAT TOQOGflGOGC CQ303AQ3IG ATAGCAG333 TAG3CIGCIT 3800 

FIG. 6B 
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TO3COXGGC CCKlTiUL'lA CTIGAAAGGC TCCIU33333 GHOJLJL ' IUJLT 3850 

GIGGGCO30G GGACACGOQ3 TO33QTEATT CN33333333 GIGIGCAOOC 3900 

GTOGAGIGGC TAAAGCGGIG GAL'iTIMOJ CTGIQ2AGAA. CCB\G33ACA. 3950 

AOCA3GAGAT CCXXX33TCTT CAGGGACAAC TCCIC'ICCAC CAGCAGIGOC 4000 

CCAGAGZTIC C7Ann?IG3Cr3C AGCIGCA3GC TODCAC033C AG333TAAGA. 4050 

GCAQ2AAGGT GQ033CIGO3 raCGCAGOOC G amriUUm 4100 

CICAAQ0QCT CtUriGC ' lGC ■AAUJC'IGGGC ' ITlUJiULUT ACA !iUlLU A 4150 

GGOOCASGGS GTIGKPDCm ATMCAGGAC 0333G7TGAGA ACAATEACEA 4200 

C1GGCAGO0C CATCAGGTJAC TOCAOCTAGG GCAAUT10LT T3033A033C 4250 

G33IQCICAG GALU1ULT1A TGACATAAEA ATTIGTGAOG AGIGUCACIC 4300 

ACATOCATCr TGG3CATCG3 CACIUIULTI' GAOCAAGCAG 4350 

AGACIGQ3GG GGGGAGACIG GT1U1GCICG CCAC'lUL'i^C O0Cim333C 4400 

TOOGICACTG TGTCOCATCC TAACAT33AG GAGGTIGC'IC T3IOCACCAC 4450 

03GAGAGATC OOCITITACG GCAAGQCTAT OOOOCTOGAG GT3ATCAAG3 4500 

G33GAAGACA TCICK1L.T1C TOCCACIOA AGAAGAAGIG OGAGGAGCIC 4550 

GDGQQ3AAQC TOGTCGCATT G33CATCAAT QQ0GIG30CT ACI^CCQGGG 4600 

TCTIGAOGTC TCIGICATCC 0GAOCAG03G OQKIUI'IUIC GIOGTGIOGA 4650 

ODGAIQCICT CATGACIGGC TITACCGQOG ACTIC3ACIC TCIGAIAGAC 4700 

TGCAACAD3T GIGICACICA GACAGTOGAT TICAQQCTIG AQQCT&CCIT 4750 

TAOCATIGAG ACAAGCAGGC TOOOOCAGGA T37IGIUIOZ AG3ACTCAAC 4800 

G203333CAG GACIGGCAGG QGGAAGOCAG GCATCEATAG ATTIGIG3CA 4850 

O09333GAGC Q3XCIC033 CA3GTIOGAC TDGTEOCXjIQI TCIGTGAGTG 4900 

CEATCADXG G3CIGTGCIT GGTJAIGAGCT CAOQ3003QC GAGACTACAG 4950 

TTAG3CTEACG AGCGTACATC AACAOX033 OJLTlUUUbT GiOJCAGGAC 5000 

CA3CTIGAAT TTIGGGAGGG CGTCITISGG GGOCICACTC A2AIAGA3GC 5050 

OCAL'i'l'l'l'lA TDOCAGACAA AGCAGAGT33 GGAGAftCTIT OCTEAOCIGG 5100 

TAQCGTAOCA AGOCAGCGTG '1UUUCTRG33 CICAAGOOOC TOXOCATOG 5150 

T333ACCAGA T3TGGAAGIG TITGATCCDC CrTAAAOOCA GCCIOCA1GG 5200 

GCCAACACGC CIGCEATACA GACTG3333C TGTICAGAAT GAAGICAOQC 5250 

1GAO3CAO0C AATCAGCAAA TACATCA1GA OOGZAIGIC G3C0GAOCIG 5300 

GAGGICGICA GGAGCAGCTG GGIU- ' IUUIT tfJOaCOIPC TG3CIGCICT 5350 

G33CGGGTAT T3QCIGTCAA CAGGCIGOGT UJICATAGIG GGCAGGATOG 5400 

TLT 1 U1 U U05 GAAGGOGGC A ATTJOaOCIG ACAQ33AQGT TCTCTAOCAG 5450 

GA Gl ' lLG AIG AGA3GGAAGA GTGCTCTCAG CACTEAGCGT ACATOGAGCA. 5500 

AQQGATCATC CiaOCTGAGC AGTICAAGCA GAAUJLUL'iC GGOCTOCTGC 5550 

AGACCGCGIC CCGCCAJGGA GAGGTTATCA GGOCIGCIGT CXZAGAGCAAC 5600 

1Q0CAGAAAC TOGAGGTCTT TIGGGGGAAG CACAIUIGGA. ATTICATCAG 5650 

T333ATACAA TACTTGGOGG GCCTGICAAC GCIGOCIGGT AAG0003XA 5700 

FIG. 6C 
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TlULTlUttiT GA3X33CTTIT PCM3CUJOOG TCA02AG33C ACIAAOZACT 
G30CAAAOGC TOCICTTCAA, CAEATIG333 G33IG33IG3 CTCOJJAGCT 
(JULIUXCCC G3IGG0OC3A ClUUL'lTlUr G33TGCIG3C CEAGCTO333 
GOQQCMCGG CAQCGITO3A CIG333AAG3 T33IOGIG3A CATICTIGCA 
G33TA1G3CG 033333IG3C Q33AQL'1L'1T GTAGCATICA AGATCA2GAG 
CGGIGAG3IC (3X'K)CACQ3 AQGfiOdQST CAA3CIGCIG 03333CA3O: 

• l u i uujL'ia^ AQOocriGm giosstgigs 'iciGOGCftGC Awracrooac: 

GGQCAOSnG G33333333A. G3333CAGIG CAAIQGMGA A3333CIAftT 
AGOCTICGOC TOCX333333A. ALXJAIIUITIU CODGA032AC TACGIG0333 
AGAG03A2GC A33333333C GTCACIGXA TOCICAGCAG OCTCACIGm 

AOOCAonac tgaq333act gcatcagigs ataaoitogg agtceaccac 

TCCA3X3CTCC G3TIDCTG3C TAAQ33ACAT CIGQGACIG3 ATATO33AG3 
TGCIGAQOGA Ci ' l ' -LA AGAOC T33CIGAAAG OCAAGCICAT GCCACAACTG 
OCIQQ3ATIC ULWIUIL' CTG0CAG03C: G33EATAG33 G33ICIG333 
AGGAGA033C ATIATOZACA CID33IGXA CIGIG3AGCT GAGA3CACIG 
GACATCICAA AAA0333AGG ATGAGGATOG 'ILUJILL'iAG GAOC7IGCAG3 
AACAIGIQGA GIG33A03IT OCCCATE^AC GOCTACAOCA O333CC0C1G 
TRL' IU UUL'IT CCTOCGOQGA. ACTATAAGIT OXGL'IGIGG AG33IGICIG 
CAGAG3AATA. 03IQGAGA3A AG30333IGG GSGftLTllXA CEAOGTATCG 
GGTAIGACTA CTCACAAICT TAAATOXC3 TO3CAGATOC CAiaSQCCGA 
ATITnCACA GAATIG3ACG G33IGCG3CT ACACAGGTIT UCUUUUULT1 1 
GCAAGCCCIT G3IG3333AG GAGGTCATCAT TCAGAGI703 ACTOCAQGAG 
TACQCG3IG3 GGflCGC A ATT AOCITGOGAG <3333AA0333 ACGTJ£3333T 
GTIGACGTDC ATOCICACIG MCCCICCCA TATAACAOCA GAQ3333333 
G3AGAAG33T G303AGAGQG TGMJLXJUL'IT C3KD33QCAG CIOCICD3CT 
AGOCAGCIGT OGQCTOCATC TCICAAG3CA ACTIGGAOOG OCAAOCAIGA 
CTOCGCIGAC G333AGCICA. TAGAG33CAA OCTOCIGTG3 AG3CZAQGAGA 
TQQ3333CAA CATCACCAG3 GTIGAGTCAG AGAACAAAGT G3IGATICIG 
GACTGCnOG A lUUGUl ' lG T Q3CAGAG3AG GAIGAGCG33 AOGICIOOGT 
ACCIGCAGAA ATICIQ033A, AGTC1D33AG AT3XX333333 QJUU IUUODG 
TCIG33QQ0G G333GACTAC AAQQCXXCGC TAGTAGAGAC GIGGAAAAAG 
CCTGACTACG AAOCAQCIGT G3TGCAIG3C 'lULU JJL ' lflC CAuJlU^AOS 
G IU JL C ' I CIT GTGCCroCGC CI033AAAAA G03TW333IG l»i u\,11£GQG 
AATCAAGQCT ATCEftCIGQC TIG3CCGAGC TIQOCAGCAA AAGTTTIG3C 
AGCTTICICAA CITO333CAT TAOQQQCGAC AATAOGACAA CATCCICTGA 
GCCCQQQCCT 'I C IGJL'IUUC GQ000GACIC CGACGTIGAG UUL'iM'lL'JLT 
OCA3G0QQ0C 0GIG3AG333 GAGQCIG333 ALLCCGGAICT CAGOGAGQQ3 
TCMGGIOGA CGGTECAGTAG T3333003AC AOQGAAGATC T33TGTGCIG 
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CTCAASGTCT •DVTIUCIUGA CAGG332ACT O3TCAGQ00G TO03TB3333 7650 

AAGAACAAAA. AC1G3XATC AA03CACIGA. (EAACIOSIT GZCA03XAT 7700 

CACAATCTG3 TGEVTIDCAC CACTICA03C AGIULTIUUU AAAG3CAGAA. 7750 

GAAAGICACA TITCACAGAC TOZAAGflTCT GGACAG3CAX TA02AG3ACG 7800 

TCCIC A AQGA, GG1CAAAGCA. GCX3333ICAA AAGIGAAG3C TAACT I UL'IA 7850 

TOOGrlRGAGS AAGCITGCAG CCIGACX33X (XACKTICAG OCAAATOZAA 7900 

WiTlUJOaT GGGGCAAAAG AOSIOOGTIG OCAroOCAGA AAQ3XEIAG 7950 

COCACATCAA CIO03IGIG3 AAAGAfJLTJAJ TGGAAGACAG T3IAACAOCA. 8000 

ATAGACACTA OCA2CA3GGC CAAGAA03AG UlTl'lUlUOS TIC A UUU1L A 8050 

GAAGOQ333T 03TAAGGCAG CIOGIUICAT O3TOTIO00C GACCIQQ303 8100 

TO030GIGIG CGAGAAGA3G GOOCTOiaOG AOGflGGTIRG CAAGCIOJUL' 8150 

CIQ300GIGA TGGGAAGCIC CEAGQGATIC CAATACICAC CAG3ACAQ03 8200 

QGTIGAATIC CTOGIGCAAG CGTGGAAGIC CAAGAAGACC O0GA3Q333T 8250 

TCIQ3IKEGA. TAOOOGCIGT TTIGACIDCA. CAGTCACIGA. GAQQGACATC 8300 

OSTAOQGAGG AQQCAATITA. CCAAlUl'lUl' GACCIGGACC C0CAAGO00G 8350 

OGTO30CATC AAGUXCICA. CIGAGAG3CT TTPTCTIG33 G3XC1LT1A 8400 

OCAATICAAG GQQGGAAAAC TG033OTCC GCAGGTOOQG CQCGAQQ332 8450 

GEACIGACAA CIAGCTGIGG TAACACCCIC ACTIGCTACA. TCAAGQ000G 8500 

G3CAG0CIGT GGAGQOGCAG GOCTCCAGGA. CTOCAC3CATC CIOGIG'IUIU 8550 

GOGAOGACTT AGfroGTTATC T3IGAAAGIG OQ3333TDCA. QGAGGACQCG 8600 

GOGAQQCIGA. GAGOCTICAC GGAQ3CIATC ACCAG^rACT COaOXOCCC 8650 

OGQ33ACQQC OCACAAQCAG AA2AOGACTT GGAGCTTATA. ACATCAIGCT 8700 

CCBXAAOGT GTCAGTOQQC CAGGA03333 CIOGAAAGAG G3ICTACIAC 8750 

CTTAO003IG AQQCEACAAC CXXCCT3303 AGAGCOQOGfT G3GAGACAGC 8800 

AAGACACACT CXZAGICAATT 0CIG3CEAG3 CAACAIAAIC AIGTTTOOOC 8850 

0C3OOGIG GGOGAGGATC MBCTGA3GA. COCATTIUIT TAGOQICCIC 8900 

ATAGOCA03G ATCAGCTIGA. ACAG3CICTT AACIGTCAGA. TCEACX33^3C 8950 

CIGCTACIDC AIAGAACCAC TGGA3CIAGC TOCAATCATT CAAAGACTOC 9000 

ATOGOCICAG OXATinCA CIOCTOmT AC1CIUCAGG TGAAA3CAAT 9050 

A33GTOQCJ0G CATOOCICAG AAAACTIGGG GTOQOGOOCT 1GOGAGCITG 9100 

GAGACAC003 GO0QQGAGGG T00303CTAG GCTTCIGIOC AGAQGAQ3CA. 9150 

G39CTG0CAT AIGTGGCAAG TAOCICTICA ACIGGGCAGT AAGAACAAAG 9200 

CTCAAACICA. CIXDCAATAGC COXGClOa: GGGCIGGACT TCIG0QGTTG 9250 

GITCAOQXT QGCTACAQOG G93SAGACAT TTATCACAGC GTCICICATC 9300 

O00330000G CIQGTICIGG TITIQOCIAC TOCIGCI03C TGCAGQQGrTA 9350 

GGCAICiaCC TCCBXOCAA O0GATCAAG3 TIGGGGTAAA CACKX33QOC 9400 

TCTEAAOOCA Tl'lU'MTi' Tl ' l ' l ' l ' l ' l ' l ' lT TlTl ' lTl ' lTr Tl'i'l'iUl'l'l'r 9450 

Trmncrr tocttiocit crrrnna: TriLTimi' utiltita a 9500 
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TOGIGOCKr ALLL'lTAGGCe TAGTCACG3C TM3CT3IGAA AQGTO3SIGA 9550 
QOQQCAIGAC TOCAGftGAGT QCIGAIACIG O Xld C'lUC AGATCAIGT 9599 

FIG. 6F 
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MSINFKPQRK TKFNINRRFQ UJKFPQOGQT VGGVYIiPRR GPRLGVRATR 50 

KTSERSQPRG FRQPIPKARR PB30WAQPG YPWPLYQSEE CGWAGWLLSP 100 

PJ3SRPSWGPT DERRFSRND3 KVHJTLIOSF ATXM3YIPLV GAPD32AARA ISO 

IAH3VRVLED GVNYATCNLP GCSFSIFLLA LLSCLTVPAS AXQVRNS9GL 200 

"YHVINDCENS SIVXEAAEAI LHTPGCVFCV RB3MASFCWV AVIFIVATRD 250 

GKLFTOSLiSl HTEUjVGSAT LCSALY\A3Xi CESVFL\02L FIFSFRRHVT 300 

TQDCNCSIYP GHTIGHRMBW EM**JWSFIA ALWAQLLRI PQADCMEfiG 350 

AHWGVIAGTA VUJVLLLFM3 VD&EIHVIG3 NAGKi'I^GLV 400 

GLLTPGAK3* IQLININGSW HINSTALNCN ESLNIGWEAG IfTQHKFNSS 450 

GCPEKLA9CR RLOTAQGWG PISXaNGSGL. EBF?PXCS«HYP PRPOGIVPAK 500 

SVOGPVYCFT PSPWVGITD RSGAPTYSWG ANDIEVFVLN NIRPPLGNWF 550 

GCLVWNSIGF TKVCGAPPCV CFIDCFRKHP EAIYSROSSG 600 

PWTTPRCMVD YPYRLWHYPC TINCTIFKVR MYVG3VEHRL EAACNWIRGE 650 

RCDLECRERS ELSPLLLSTT QWQVLPCSFT TLPALSIGLI HLH3SIIVEWQ 700 

YLYGW3SSIA SWAJKWEYW LLFLLLAEftR VCSCtJ^WLL ISQAEAALEN 750 

LVHNAASLA GIB2LVSFLV FFOTAWYLKG EWVFGAVYAL YGMWPLLLLIj 800 

IALPQRAYAL DTEVAAS033 WLWGLZ^LT LSPYYKRYIS VOS^JWDQVFL 850 

TRVEAQLHVW VPPLNVRQGR n& VTT ,T M "W HPILVFDTTK T.TJ AIFGFLW 900 

HJQASLLKVP YFVRVQGLIil ICALARKIAG GiYVQMAIIK D3MJIGTWY 950 

NHLTPLJ^DWA HNGLPDLAVA VEPWFSFME IKLTCWGADT AAOGDIINGL 1000 

FVSARRQQEI LLGPAEQWS KGWKLLAPIT AY7^QQTRGLL GCIITSUEGR 1050 

DKNQVEGEVQ IVSTA.TQTFL ATCINGVCWT VYH3AGIKIT ASPKGPVIQM 1100 

YINVDQDLVG WPAPQGSRSL TPCTCGSSDL YLVIRHADv/I PVRRRGDSRG 1150 

SLLSPRPISY IKGSSQGELL CPAGHAVGLF IWMZDRGVA KAVEFIPVEN 1200 

LGTIMRSPVF TTNSSPPAVP QSFCMAHLHA PTG9GKSIKV EAAYAAQGYK 1250 

VLVLNPSVAA. TIGFGAYMSK AH3VDPNXRT GVRiTl'IGSP HYSKGKFL 1300 

ADG3C9GGAY DIIICDECHS 1DATSHJSIG 1VLDQAEIRG ARLWLAIfcT 1350 

PPGSVIVSHP NIEEVZ^LSIT GELPFYGKAI PLEVIKQGRH LIPCHSKKKC 1400 

EEIAAKLVAL GINAVAYYPG LD/SVIPTSG EVWVSIDAL MlUr'lUUbDS 1450 

VUXNICVIQ TVDFSLDPTF TIETITLPQD AVSRTQRRGR TGR2CPGIYR 1500 

FVAFGERP9G MFDSSVLCEC YDAGCAWVTEL, TPAETIVRLR A5MJTPGLPV 1550 

CQEHLEFWEE VFIGLTHIEA HFLSQTKQ9G ENFPYLVAYQ ATVCARAQAP 1600 

PPSWDQMrtKC LIRLKPTLH3 PTPLLYRLGA. V^SEVITiTHP I7IKYIMICMS 1650 

ADLEWTSIW VLVGGVLAAL AAXCLSIQCV VIVGRIVLS3 KPAIIPEREV 1700 

LYQEFT3EMEE CSCHLPYTJEQ GMMLAEQETCQ KATfTTJQIAS RHAEVETPAV 1750 

QINW3<LEVF W&KHMWETS GIQXIAGLST LPGNPAIASL MAFTAAVISP 1800 

L TJ.Gfli.U jFN UJ3QWWAAQL AAPGAATAFV GAGLAGAAIG SM3LGKVLVD 1850 

XLAGYGAGVA GALVAFKIMS GEVPSIEJXV NLLPAUjSPG ALW3V\A2AA 1900 
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HHFHVGF3E GfiVQWWRLI AEASRGNHVS PTHYVPESDA AARVTATJLSS 
L1VIQLLRRL H3WISSECTT PC9SSWLRDI WDWICEVLSD FKIWLKaKIM 
PQLPGIEFVS GQRGXFGW\IR. GDGBMIPCH CGAETIGHVK N3IMRIVGPR 
TCTOMtfSGIF PINAYITCPC TPLPAPNYKF AIWFVSAEEY VELKKVGDEH 
YVSGMITENL KCFCQIPSPE FFIELDGVRL HREAPPCKPL LKEEVSFKVG 
LHEYPvASSQL FCEPEPDtfAV LT34L1DPSH riAEAAGRRIj ARGSPPSMAS 
SSASQLSAPS LKATCEANHD SEDAELIEAN LIWRQEM3GN ITKVESENKV' 
VELDSFDPLV AEEEEREVSV PAEHLRKSRR EARALPvV&R PDYMPPLVET 
WKKPDYEPPV VB3CPLPPPR SPPVPPPRKK RIWLIESTIj STALAELA1K 
SPGSSST9GI TGCNTTISSE PAPSQCPPDS EVESY5SMPP T .M.a.!U jpH 
SDGSWSIVSS GADIHJvAKDC SMSYSWIGAL VTFCAAEEEK LPINALSNSL 
LRHHNLVYST TSRSAO^QK WTETKLQJL DSHYGEHLKE VKAAASKVKA 
NLLSVEEACS LTPPHSAKSK P3XGAKEVRC HARKAVAHIN SVWKELLEDS 
VTPiJJl'l'lMA. KNEVFCVQPE KQ3RKPAELI VFHXGVRVC EKM&LYDWS 
KLPLAVM3SS VEFLVQAWKS KKTPM3FSYD IFO-Ub'lVTE 

SDIKTEEAJY QQCDLDFQAR VATJCSLTERL Y\A3GPLHSBR GENQGXRRCR 
ASGVLTTSCG NIKECYIKAR AACRAAGLQD CIMLVOGEDL WICESAGM2 
EDAASLRAFT EAMIHYSAPP GDPPQPEYDL ELTT9CSSNV SVAHDGAGKR 
WYL1RDPTT PLARAAWETA RHTFUNSrtLG NIIMFAPTIJW ARMULMIHET 
SVLIARDQLE QALMZETYGA CYSIEPLDLP PIIQRIH3LS AFSLHSYSPG 
EINRVAACLR KLGVPPLEAW RHRARSVRAR LLSRQGRAAI OGKXLFNWRV 
RIKLKLTPIA AAGRLDLSGW FEAGYSQGDI YHSVSHARPR WR^FXZLLLLA 
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QCEM30Q00C TCA3Q33GGC GACACIQ3C CKIGAATCAC TOGOC3GIG& 50 

GGAACTACIG TCITCAOQCA GAAAG03ICT AGGCATOGCG TIAGTATCAG 100 

■jGiaaiGcaG (xtocaqgac ccccociax gggagagoca tbgjkgiciu 150 

CQGAACCGGT GAGTACAOQG GAAIIGCCAG GAO3AG0333 'lUJlTlLTlU 200 

GATCAAGOOS CICAATGOCT GGAGATITO3 GOGTODDCCC GCX3AGACIGC 250 

TAGOCGAGTA GTGTIGGoIC GOGAAAGQOC T1U1GSUACT QGL'IGKERGG 300 

GTOCTIGGGA GTGQOXX3C3S AQoIUIUGTA GAGOGTGCAC CATCAGCACG 350 

AATCCTAAAC CICAAAGAAA AACGAAAOGT AACAGCAAGC GO0Q00CACA 400 

Q3AQGICAAG TIC00GGG0G GTOGICAGAT CUTimiOA GTTIAQCICT 450 

TQQOQQGCAG GGQGCCCAGG TlUU mG lGC: GOGOGACTAG GAAGGCTTOC 500 

GAGOGGTOQC AACCT03IG3 AAQGGGACAA. CCTATOOCAA AGGCTOGCOG 550 

ACOOGAGQQC AQQQOCTGGG CICAG00033 GTACGCTIGG UCOJIL'UATC 600 

GCAATCAGGG OCIGGG3TQG QCAGGATOGC TCCIGTCACC OOGOGGCIOC 650 

CQQCCEAGTT G3GQ000CAC GGAOOOOOQG OGIBGGTOQC GTAACTIGG3 700 

TAAGGTCATC GATACOCITA CATGOGGCIT O3C0GATCIC ATGGGGTACA 750 

TTCOGCTOGT OGGOGCOQCC CEAGGQGGOG CIGOCAGGQC CTIGGCACAC 800 

GGTGTOOQQG TICIQGAGGA OGGOGTGAAC TATGCAACAG GGAACTIQOC 850 

CX3GTIGCICr ITCICTftJCT TCCICITOGC TCIGCTOTOC 1GTTIGAOCA 900 

TCGCAGCTTC CGCTEATGAA GIQ0GCAAO3 TGIOOGGGAT ATAGGATCIC 950 

AOGAAOGACr GCTOCAACIC AAGCATIGTG TATCAG3CAG OGGACGTGAT 1000 

CAT3CATACT C00GQGTG33 TGOOCIGIGT TCAGGAQGGT AACAGCTGOC 1050 

GTIGCTGQGT AGOQCTCACT OOCACXSCTCG OGGGCAQGAA 1GOCAQOGIC 1100 

CXXACTACGA CAATA03AOG CCAOGIOGAC TTGCTOGTIG GGAOQGCIGC 1150 

Tl ' lUlUC ' lO C GCTATGTAQG TGGGGGATCT CIGC3GGATCT ATTITOCTCG 1200 

TCTOOCAGCT GTICAGCTIC TCGQCICGOC GGCA3GAGAC AGTGCAGGAC 1250 

TOCAACIGCT CAA'lL' m Ol' OGQCCATGIA TCBGGTCAOC GCATOQCTTG 1300 

QGATATCATC ATGAACIGGT CACCEACAAC AGCCCTAGTG GIGTOQCAGT 1350 

TOCTCCGGAT CCX2ACAAGCT G1U3IGGAC&, T33TOG03G3 GGCGCACIGG 1400 

GGfiGTDCIQG CQQQOCTTOC CTACEATTOC AIGGTAG33A ACIGQGCTAA 1450 

(jL.TlC'lGA JT G'lUUUUL'lftC TCITIGQ0G3 OGTIGAOGGG GAGAOGCACA 1500 

CGAOGGGGAG GGIGGOOQGC CACACCAOCT OOGQGTICAC GTCGCTITIC 1550 

TCA3CIGGQG OG1C1CAGAA AATCCAGCIT GTGAATAGCA AGGGCAGCIG 1600 

G2ACATCAAC AGGAL'IUUUU TAAATIGCAA TGACICCCIC CAAACIQGGT 1650 

' IL ' m UUOGC GCTGTTTIAC GCACACAAGT TCAACTOGTC OTQSTQCCOG 1700 

GAGQGCATGG OCAGCTG00G COQCATIGAC TGGTTOQOaC AGGQGTQQQG 1750 

O0CCA3CAOC TATACTAAQC CTAACAGCIC GGATCAGAGG CCTEATIQCT 1800 

GGCATEAC3C QOCTOGAOOG TGTQG3GIOG TAOQCGOGTC GCAG3TGTGT 1850 

GGTCCAGIGT ATIGTTICAC COCAAGQ0CT GTIGTGGIGG GGADCACOGA 1900 
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'imriixasr uiuultaogt popqctqqqg ggmsmo&g jogbosiga 1950 

TQL'ILC'I CA A CftACA03D3T OCQ3CACAAG GCftACIGSIT CX33TIUIACA 2000 

T33A.TCAATA GIACIQ33IT CACEAAGAOG TOOQSftGSIC OOOOGIGTAA. 2050 

CA3U33QG33 GTO33T&A0C QCAOCTK»T CTO000CAO3 GftCIGCTTOC 2100 

(33AAGCA0QC 03AG30TACT T&CACAAAAT GIQ3CI03QG G30CIG3TIG 2150 

ACAOCTAG3T GGCTftGiaGA cmOGGAZAC AJULlTlUX' ACT fl Cai C LU 2200 

CACICTCAAT TITIULKIIT GaiGTATOIG G33XOGf iai 2250 

AGCACAQ3CT CAATOCOXA TOCAATIQ3A. CT33A03ftGA GOGCIG'IJAAC 2300 

TIQ3AQ3ACA OLUL'lUL'IGC TOICTACAAC 2350 

AGAGTQ3CAG ATACIGOCCT GIGCTTICAC CA033T&003 ULTl'lMULA 2400 

CTG3TTIGAT OCATCIOCAT CAGAACATCG TOSftOSIQCA AUOl'lUi^C 2450 

GGTEGTAG33T CAGOGTnGT CICXTTTTOCA ATCAAA3G33 AGTftCATOCT 2500 

GTIQCnTIC CTICIDCIG3 CAGAQ30303 O3IGIGIQ0C TOLT1U1UGA 2550 

T3ATOCT3CT GAT7^GCCCAG GCIGAC93XG CUTEAGAGAA CTTOSIOGflC 2600 

CICAA3G333 O3ID0GTOGC OQGAQCX3CAT GUJMI'ICICT (JLTl'lLTlUr 2650 

Gmcncioc oaooacioGr acattaagsg caqqcioxt ocroasaaoG 2700 

CCTATGCTTT TEAT3333TA. TQQQ09CTK9C TOCIGCKTT ACIGOOGTIA 2750 

OCACCACXSAG CTEACGQCTr GGA003QGAG ATO30TQ0AT 031X3003333 2800 

TGCGoTIdT GTA03TCTO3 TATTCTIGftC LT1U1CAOCA TACTACAAAG 2850 

TSTTIXnCAC TftGQCICMA TOGIGSTTAC AAIACnTAT CAOCAGAOOC 2900 

GAQGOGCACA TQCAAGIGTO QGTDQOOOOOC CICAALUl'lC Q33GftQ3103 2950 

O3AT30CATC ATGCICCICA CGIGIQQQGT 1CA3DCAGAG TmATTTTIG 3000 

ACATCAOCAA ACTCCIQCIC GQCATACTOG GOOOQCICAT Q3BXTCCAG 3050 

GCTG3CATAA OGAGAGIQOC GTACTIOSTG CQ03CTCAAG Q30ICATIO3 3100 

T3CAT3CATC TEAGIG03AA AAGIO30033 QOGTCMTRT GTOCAAATO3 3150 

TCTICATCAA QCIG330Q0G CIGACAGGIA CGTftQOTTEA. TAAOCMCIT 3200 

ADOQCACIGC G3GACT3330 OCACG3333C CT»D3AGAOC TIG303IQ3C 3250 

QGTAGAQOOC UlUb'lLTILT 0QG0CA3Q3A GAOCAAG3TC ATCAOCT03S 3300 

GAQCAGACAC OQCIQOGflGT Q33GACATCA 1CTIQ33TCT AG003ICTOC 3350 

GCCCGAAG33 Q3AAQ3AGAT AT3TTIG33A CD330TGATA. GTCICGAAGG 3400 

GCAAQ33TG3 CGACTOCTTC O300CATCAC O30CTACIQC CAACAAA03C 3450 

GG3QQGTACT TOGTIQCftTC ATCACTAGOC TCACAQ300G Q3ACAAGAAC 3500 

CAGGTOGAAG Q33AG3ITCA. AGTGSTTICr AOOGCAACAC AAIL'ITIULT 3550 

GGCX3AOCTQC ATCAAOQQ03 TG7K3C1GGAC TSICEftOCAT 0303CT33CT 3600 

OGAAGACXTT AG0033TOCA AAAQGfTOCAA TCAOOCAAAT GEACADCAAT 3650 

GTAGAQCTO3 AQCICGTDQG CTGQCAQQOG (XO0OO33G G 03CQCTCCAT 3700 

GACAOCAT3C AGCT3IQ3CA QCTGQGAOCT TEACTK33IC A03AGACAT3 3750 

CIGAT3ICAT TCXX3GTGOGC O33QGAQQ0G ACAQCAG333 AAGTCTACIC 3800 
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T00C30CAO3C CEGICIOTA 0CIGAAAG3C TOCI033GfIG GICCATTGCT 3850 

TIQ03CTIO3 GOQCftOGTOG T33333ICIT OT333CTQCT GTC1GCACXX 3900 

0393931030 GAAQ30331G GACJl'lUATAC GCGTIGAGIC TKTQGAAACT 3950 

ACCATO333T CTOCGGICIT CACAGACAAC TCAAOOOXC 033JlUiJACC 4000 

QCJySACATIC CAAGIQ3GAC A ICIGCA CGC 'iUL'iaCIQSC AG3332AAGA 4050 

Q2AOCAAAGT QOCX33CIGD3 TATOCAGOCC AAG3oIACAA G3IGCI03IC 4100 

CIGAACOOST OJbTlUUUGC OOZETAG33 Tl'lUUUULUr AIATCKEAA 4150 

G33ACA033T ATOGAOC3TIA ACAICAGAAC TO333TAAG3 AOCATiaOCA. 4200 

IUJ3CGGCII' CATTAQ3TAC TOCADCTAIG GCAAUl'lULT 1Q00GAO93T 4250 

G3CIGTICIG Q393030CIA TCACATCA3A ATAl'GTCMG A3IQ0CACIC 4300 

AACIGACroS ACMCATCT T333CATO33 CACACIXXTC GAOCAAQQ93 4350 

AGA093CIG3 AGOQ0O33C GTO3TGCI03 OCAUJUL'EAC ACCID0333A 4400 

T333TTACCG T30CACAQ0C CAATAIDoAG GAAAITAQ33C TCIXXAACAA 4450 

TO3AGAGATC UJLT1UUKIG GCAAAQ3CAT GCCA1CAAG3 4500 

Q339SAQ3CA. TCICATJ.T1C TOCCATTCCA AGAAGAAATC TCACGAGCTC 4550 

G003CAAAQC T3ACA030CT 033ACIGAAC QL'lUilAGCAT ATEAQ03333 4600 

OCTIGAJIGTG TCXX3ICATAC O30CIA3O3G AGACGICGTT GTOGIQ9CAA 4650 

CAGACQCTCT AAT3A0333T TICAC03303 ATITIGACIC AGIGAIGGAC 4700 

T3CAATACAT GIGICA0C3CA GACAGIDGAC TICAQCTIOS ATOCXZftOCIT 4750 

CACCATIGAG ACGADGACDG T3QOQCAAGA O3O33T3T0G 03CI03CAAC 4800 

G303AQ3TAG AACIQQCAQG GGTAQ3AGIG G2AIUIACAG GTTIGIGACr 4850 

OCAGGAGAAC 0330X710333 CALLGT1CGAT TCTT333IOC TGTGTGAGTG 4900 

CIATCACQCG OOCIGIOCTT GGTCATCAGCT CACX3QCD3CT GAGAOC1G33 4950 

TTAQ3TTQCG 03CTTACCTA AATACACCAG Q3TIGO003T CIGOCAGGAC 5000 

CATC3GGAGT TCIQ3GAGAG OGTCTICACA. QGCC'ICAOOC ACATAGA3QC 5050 

OCALTILL'IG TOOCAGACIA AACAG3CAGG AGAGAACTIT CCTEACCIGG 5100 

T3GCATATCA AGCTACAGIG T3CX3CCAG33 CICAAQCTOC ALXJ1CCMXX3 5150 

T333AQCAAA 7GHGGAAGIG TCICATACGG CIG7\AAOCTA CACIGCAOGG 5200 

GCCAACAOOC CT3TIGTATA 03CTAQ3AQC OGTOCAAAAT GAG3TCATOC 5250 

1CACACADZC CATAACTAAA TACATCA3Q3 CATO2ALLGTC G3CIGADCIG 5300 

GAQj'ILG'I CA CTAGCAOCIG G3IGCIGGIA G3CGGAGTOC TIGCAGCTIT 5350 

O30O3GATAC T30CIGAOGA CAGGCAGTGT GGTCATIGTG GG2AQGATCA 5400 

'l UTiGlCOGG GAAGOCAGCr UlUbTlUXG ACAGGGAAGT CCKITAQCAG 5450 

GAGTTOGATC AGATGGAAGA G ' lG'lGGL'lCA . CAALTIUL'IT ACATO3AGCA 5500 

Q33AATQCAG CTCGCCGAQC AATICAAGCA AAAGQOGCTC G33TTJ3TIGC 5550 

AAACO30CAC CAAGCAAGOG GAGGCIGCIG CIO0CX3TG3T GGAGTOCftAG 5600 

TGGOGAQQQC TIGAGAOCTT CK333CGAAG CACAIGTO3A. ATITCATCAG 5650 

CX3GAATACAG TAOCIAQCAG OCTTATOCAC TCIGOCIGGA. AAGOOGOOGA 5700 
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TAGCATCAIT GA3Q3GATTT ACAQCTICIA TCACTMOCC GdCAOCAOC 5750 

CftAAACftOOC lUL'IUl ' lTA A CftTCITG333 Q3A1Q35IQS CIQCOCftftCT 5800 

OG L ' IUL ' IOX ' ABOXIQCOT CAGCTTTOGr Q33O30CQGC MOT093fiG 5850 

CGGL'lUl'IGG CAQCATAQ3C CTTO33AAGG TXTOGIQSA CAICTT33CE 5900 

G3CIKD339S CM3333I2X3C CXSGOSCaCIC WlUUULTriA AGGflCftTCRG 5950 

GGGOGAGGIG CXTTOCAOOG AGjAJJL'lUbT CAACTIACTC UCIUJUATOC 6000 

'lU l UlGlIGG 1GOC3CIG3IC GTD3GGGTO3 TCTOCGCBGC AATTOODGT 6050 

OGQCauaiGS QCn33G GA GA, G3333CIGIG CAGTO3ATCA AC033ZTCAT 6100 

AQUb Tl LULT 1D3CX3393m ACXaQSICIC QCETaOSCAC TKIUIULUIG 6150 

AGAQCGAOGC TOCAGZA03T (3TCACTCAGA. 'lUL'lL'lL'HAG OCTDUXKIC 6200 

ACICAACIQC 1GAAG333CT OCftOCAGIQS ATIAATSAQS ACIQCTCEftC 6250 

GCCKFQCTQC G3CIOGIGGC TAAGGGAT3T TK33GATIQ3 ATATOGAOSG 6300 

T3TIGACIGA. CITCAAGACC TGGCIOCAGT CCAAACIDCT UULUUUbTIA 6350 

COQQGAGIDC L ' lTl L XJlUl L' ATOOCAAQQC G3GTACAAGG GAGIUIT3GCG 6400 

QQGQ3AQQSC ATCA3QCAAA OCACXTTOOQC ATtX3GGAQCA CAGATO3QQG 6450 

GACft lUICA ft. AAADOGTIDC ATGAGGAIQG TAG330CEAG AftOCIQCAGC 6500 

AACAQGJIQ3C ACX33AAOG3T CCOCATCAAC GCAIACACCA 0033AULT1U 6550 

CACAOCCTCC OCX330QCOCA. ACTATICCAG G303CEAIQ3 GGQGTOGCIG 6600 

CT3AQGAGIA CX3IQ3AG3IT ACQQGIGIQ3 GQGftli'l'lULA CIBGGTCADG 6650 

G3CATCAQCA CTGACAAQGT AAAGIQQOCA T3CCAGGTIC O33QQ000GA 6700 

AT1LT1CAQG GAGGflQGATG GAGTK3CX33IT GCACAGGTCftC QCICO3G0GT 6750 

GCAAAOCICT TCTADQQGftG GAOGTCACGT TOCAQSKX33 QCTCAAOCAA 6800 

TACTIQGTTQG GGTCQCAQCT OOCATQOGAG OCCGAAQOGG AOGTAACAGT 6850 

GCITftCTICC ATQCTCAOOG ATCQCIOXA CATTACAQCA GftGACG3CTA 6900 

AQCX3TAG3CT OQCTAGAGGG ICIOCQOQCT CITEAGC3CAG CTCA3CRQCT 6950 

AGOCAGITCT CIQOQC3CTIC TTIGAAQQOG ACATOCACTA CGZAOCATCA 7000 

CTOOOOGGftC OTGftOCICA TCGAG3CCAA COCTIGIQ3 OSGCAGGAGA 7050 

1GQQ0QGAAA CA3CACIOGC GIQ3AGICAG AGAAIAAGGT AGIAATICIG 7100 

GACICTTIOG AACUUL'l'lLA OQOQGAGQQG GATCAGAGGG AGA3MOOGT 7150 

OXX3QQGGAG ATQCIGCGAA AATCCAG3AA GTICGCCICA G0GTIQ00CA 7200 

TftTGGGCftOG OGCQGACIAC AATCCIOCAC TGCEAGAGTC CTQGAAGGftC 7250 

OCQGACC^G U UJLlUUUtjr GGTACAQ3GA TOQCCATIOC CAOCTAQCAA 7300 

GQCICCroCA ATACCAOCIC CAQ3GAGAAA GAGGAGGGTT GTOCIGACAG 7350 

AATCCAATGT GrCTICTOOC TIGGCGGAGC TOXCACIAA GAOCTKX3GT 7400 

AQCTOZGGAT OTICGGOCGT TGATAQQ93C ACUXGAODG OOCTTOCTGA 7450 

OJIGGCeiCC GACGAOSGT3 ACAAAGGATC 0GAQ3TIGAG TOGfTftCTOCT 7500 

OCAT3Q000C OCTIGAAQGG GAQCCG3GGG ACOCTGAIUT CAGCGACQGG 7550 

TCTTGGICIA OCX3IGAGIGA GGAGGCTAGT GAGGAIUI03 TCIQCTGCIC 7600 
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AATGTOCEAT AOSIGSACAG GCGOCOGAT CA033GAIGC GCIG033AGG 7650 

AAAGTAAGCT GOOCATCAAC CX3GTIGAGCA. ACTCITIQCT GQ3ICACCAC 7700 

AACATOGICT AOQOCACAAC AIOOOGCAGC GCAAGUCIOC G3CAGAAGAA 7750 

Q3ICADCTIT GACAGATIGC AAGTOCIGGA TGA11UA11TAC GG33ACI7IAC 7800 

TCAAQGRGAT GAAQ3CGAAG GCGIUCACAG TIAAG3CrAA GCTICEATCT 7850 

AaBGAGGAGG OCIQCAAGCT GA0330XCA CATTCGSCCA AATOCAAATT 7900 

1Q3CEA3Q33 QCAAAGGAGG TOCX3GAAOCT ASOCAGCAGG GOGUITAAGC 7950 

ACAT00GCIC OGIGIGGGAG GALT1UL, , .LU3 AAGACACTGA AACAOCAATr 8000 

GACAOCAOCA TCATOQCAAA AAGTGAG3TT T1U1ULU1UC AAGCAGAGAA 8050 

G3GAGGGCGC AAG0CAGCIC GCCITAIDoT ATIOOCAGAC CTX333AGTIC 8100 

G'lUlMGOGA GAAGATOGOC CTITAOGAOG 'lUi'ICTCCAC OLTIUL'ICAG 8150 

GQ0GTGAT33 GC'ICL'ICATA OGGATTICAA TACICOOOZA. AQCAGQGGGT 8200 

a3AGTTOCIG GIGAATAOCT GGAAATCAAA GAAA3GOOCT AIGGGCITCT 8250 

CAHMGACAC G0QCT3ITIT GACICAAD33 TCACIGAGAG T3ACATICGT 8300 

GTIGAGGAGT CAATTIAGCA A3GTIGIGAC TIQGQ00003 AGG0CAGACA 8350 

GG0CA3AAGG TCQCICACAG AQ033CITEA CA3G333GGT GOOTGACIA. 8400 

ACICAAAAGG GCAGAACIGC GGTTATCGOC GGIG00303C AAGTGGOGTG 8450 

CIGAOGACTA GL'lUUUb'lAA TAOOCICACA 1U1TALT1UA AGGOGACIGC 8500 

AGQCIGTQGA GCIGCAAAGC TCCAGGACIG CAOGA3GCIC GTGAAGQGAG 8550 

AOGAQCTIGT (JbT l M C lGl 1 GAAAGCGCGS GAAOOCAGGA OGATOOGGCG 8600 

GCCCEAOGAG GCTTCACGGA QQCIA3GACT AGG1ATIOOG OO0QO00OGG 8650 

GGAT0QQQ0C CAAOCAGAAT AOGALC'IGGA GdGATAACA. TCA3GTTOCT 8700 

OCAA3GIGTC AGIOQQQCAC GAUGCATCIG GGAAAAQQGT A3AC3AGCIC 8750 

ACOOGIGACC CCAOCAGOOC OCTIGCAGGG GCK3CGT333 AGACAQCTAG 8800 

ACACACieCA ATCAACICIT GGCIAGGCAA TATCATCATC TATGG3CDCA 8850 

OQCIATGGGC AAGGAIGATT CTGAIGACIC AL'ITITIUIC CKiULTlLTA 8900 

GCICAAGAGC AACTTGAAAA AGOOCIGGAT TGICAGATCT AOGQGQCnG 8950 

CTACIOCATr GAGOCACTIG ACCTACCTZA GATCATIGAA. OGACK3CA3G 9000 

GTCTTAQOQC ATITACACTC CACAGfTEACT CIDCAG3TGA. GATCAATAGG 9050 

GTQQCTICAT QQCICAGGAA ACTIGQGGTA OCAO0CTIGC GAAOCIQGAG 9100 

ACATCQQGOC AGAAGTGTGC GOQCTAAQCT AL'IGIUXAG G3333 GAG3G 9150 

OCX3CCACnG TQQCAGATAC CICTTEAACr GQGCAGTAAG GAOCAAGCIT 9200 

AAACICACIC CAATC00QGC CQ0GIO0CAG CIGGACTIGT CIGGCIGGTT 9250 

OJIUUCIGG T TACAGOQQQG GAGACATATA TCACAGOCIG TCKGIGOOC 9300 

gacoooqcig umumrm tocceacicc tactticigt AQQGGEAGQC 9350 

ATTEACCIGC TQ00CAAO0G ATGAACQGGG AGCIAAGCAC TOCAGQOCIT 9400 

aagqcattic LMTrmT ' iTiTii ' rx ' iT TrnrnnT iitititi'it 9450 

' ITldTlULT T1U.TH-T1T TiTlULTl ' lC TITIT32CTT CITEAAtlGGT 9500 
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MSINFKPQRK 1KRNINRRPQ EVKFFG332I V33VYLLPRR GPRLGVRATR 50 

KASERSQPH3 EPQPIEKRRR PB3RAWAQPG YPWPLYGN33 D3WW3WLISP 100 

R3SRPSWGPT DPRRKSRNLG KVTjmiTOGF AOJCYIELV GftELGSAftRA 150 

LAH3VRVLED GVNXKFGNLP QCSFSIFLIA LLSCLTTPAS AYEVENV93I 200 

YH7INDCSNS SIWEAAEWI MHTPQCVPCV QEGNSSRCW AETPTIAARN 250 

AS VKTITJLKK HVE6LIM3TAA FCSAMY\K3X CJSSIELiVSQL. FIFSPKRHET 300 

VQXNCSIYP GHVSGHRMAW EM^JWSPTT ALWSQLIBI PQAWHWRG 350 

AHW3VLAGLA YYSMW3MAK VLIVALLEAG VOGEIHTIGR VAGftnSGPT 400 

SLFS9GASQK IQLVNIN3SW HINKIAIIOJ DSLQIGFEfcA IFiffiHKFNSS 450 

QCFERMASCR PHWFftQGWG PITYIKPNSS DQRPYOraiA PRPOSWPAS 500 

QVCGPVYCFT PSPVWGTID RS3VPIYSWG ENEID7MLLN NIREPQGNWF 550 

GCIVMNSIGF TKTOQ3PPCN IGGVGNKIIJ: CPTCCFEKHP EAIYIKD393 600 

PWLTPKXVD YPYELtfHYPC TXNreiFKVR MYVG3VEHRL NftACNWIBGE 650 

PCNLEERERS ELSPLLLSTT EWQILPCAFT TLPALSTP3LI HLH3NIVCM3 700 

YLYGVGSAFV SFAIKWEYTX LLETJLLAEftR VCAOJAM^LL IAQAEAALEN 750 

LWLNAASVA GAH3ILSFLV FFOVWYTKG FiAPGAAYAF YOJWPLLLLL, 800 

LALPPRAYAL EREMAA9CGG AVLV3LVFLT LSPYYKVFLT RLIVMLQYFI 850 

IRAEAHM^W VPPLNVRGGR DAIILLTCfiV HPELIFDITK IiLAIDSPLM 900 

VLQAGTIKVP YFVFAQGLIR ACMLVRKUAG OiYVQMVPMK D3ALTCIYVY 950 

NHLTPLREWA. HAGLFIXAVA VEPWFSAME 1KVTIW3ADT AAD3DIID3L 1000 

PVSARFO^EI FLGPADSLEE QGWRLLAPIT AYSQQIEGVL QCIIISL'XGR 1050 

EKNQVEGEVQ WSlftlQSEL ATCENGVCWT VYH3AGSKTL AGPKGPITQM 1100 

YINVEOXVG WQAPPGARSM TPCSOGSSDL YLVI^HAEVI PVRRPGDSRG 1150 

SLLSPRPVSY LKGSSQGPLL CPSGHWGVF FAAVCIR3VA KAVuf lWES 1200 

METIMRSPVF UNSTPPAVP QTIT^/AHLHA PIG9GKSIKV PAAYAAQGYK 1250 

VLVLNPSVAA •nJGFGAYMSK AHGIDPNTRT GVKITl'lUGS TIYSTXGKFI, 1300 

ADGGC9GGAY DIIICDECHS IDSTITLGIG IVLDQAEEAG ARLWIATOT 1350 

PPGSVIVPHP HEEE2GLSNN GEEPFYGKAI PHAIKGGRH IJPCHSKKKC 1400 

EELAAKLTGL GLNAVAYYRG LEMSVIPPIG EVWVA3TAL MIGFIGEEDS 1450 

VliXNKVIQ TVEFSLDPTF TIEITIV PCP AVSRSQRRGR TGR3RSGKR 1500 

FVTPGERPSG MFDSSVLCEE YDAGCAWYEL TPAEISVRLR AXLNTOGLPV 1550 

OQEHLEFWES VFTGLTHTCA HFLSQIKQftG ENFPYLVBXQ A3VCARAQAP 1600 

PPSWDtJWKC LIRLKPTLH3 PTPLLYRLGA V^sEVTLTHP riKJOMftCMS 1650 

ADLEWTSIW VLVQGVLAAL AAYCLTIGSV V3V3RIHSG KPAWPEPEV 1700 

LYQEFDEMEE CA9QLPYIEQ GM^LAEQFKQ KATfTTQIAT KQAEAAAPW 1750 

ESKWRALEIF WAKHMaNFTS GIQYLAGLST LPGNPAIASL MAFTASTTSP 1800 

LTTQNIUfN ILGGWVAAQL APPSAASAFV GAGIAGAAV3 SIGLGKVLVD 1850 

3XAGYGAGVA GALVAFKVMS GEVPSTEDLV NLLPAILSFG ALVVGWCAA 1900 
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PRLPGVPFLS O^RGYKGVWR GDGB4QITCP OGBQIAGHVK N324RIVGPR 
TCaiIV«3IF PINAYTCGFC TPSPAHWSR AIWRHAAEEY VEVXRM3EFH 
WIGMTIENV KCPCQVPAEE FFIEVDGWRL HRYAPACKPL Ii*ED/nW3 
LN2YLU39X PCEPEPEVIV LTSMLTDPSH 1TALTAKRRL ARGSPPSLAS 
SSASQLSAPS IKATCTIHHD SPDftTT.TTON I1HRQEM3GN HKVESENKV 
VELDSFEPIH A&ETJJRKSRK FPSALPIV^R PDiNPPIiES 

WKDPDYVPPV VHXHjPPIK APPIEPPKRK SSAIAELATK 
TF3SS3SSAV DSGTEKEALfD LASCCGEKGS EVESYS£MPP LEGEPGDPEL 
SDGSWSIVSE EASETWCCS MSYIWIGALI TPCAAEESKL PINPI^NSLL 
RHHMWYATT SRSASL«2KK VTFIFLQJLD EHZREWLKEM KAKASTVKAK 
LLSIEEACKL TPPHSAKSKF GYGAKEMRNL SSRAVNHIRS VWEH1S5EE 
TPIDrnMAK SEWPCM3EEK GGRKPARLIV FPELGVRSAZE KMALYTWST 
LPQAVM3SSY GPQ*SPKQRV EFIiVNIWKSK KCEM3FSYDT PCFDSIVIES 
DZRVEESIYQ CCIXAPE?^RQ AIRSLTEKLY IGGPIUNSKS O^DGYRFCRA 
SGWLTISO^I TLTCYLKMA ACRAAKDQDC TMLVN3XLV VICESAGIQE 
EftAALRAFIE AMIKYSAPPG DPPQPEWXE LTISCSSNVS VAHDfcSGKRV 
YYLTRDPTTP LARAAWEIAR HIPINSWLGN IIMZAPTIJWA. PMUMMFFS 
HjLAQEQLEX ALDQQIYGAC YSIEPLEfcFQ IIE231H3LSA FIIHSYSPGE 
INRVASCLRK IJ3VPPLKIV2R HRARSVRAKL LSQQ3RAATC GRiUNWAVR 
TKLKLTPIPA A9QTJLSGWF VAG¥S3GDIY HSLSRARPRW FPITLLLLSV 
GM3IYLLFNR 
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Emerson, Suzanne 
Bukh, Jens 
Purcell, Robert 

<120> Cloned Genome of Infectious Hepatitis C Viruses of 
Genotype 2a and Uses Thereof 

<130> 20264302PC 

<140> TBA 

<141> 2000-06-02 

<150> 60/137,693 
<151> 1999-06-04 

<160> 39 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 9711 
<212> DNA 

<213> Hepatitis C virus 
<400> 1 

acccgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60 
cttcacgcag aaagcgtcta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 12 0 
ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 18 0 
aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 24 0 
caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 3 00 
tgcttgcgag tgccccggga ggtctcgtag accgtgcacc atgagcacaa atcctaaacc 3 60 
tcaaagaaaa accaaaagaa acaccaaccg tcgcccacaa gacgttaagt ttccgggcgg 42 0 
cggccagatc gttggcggag tatacttgtt gccgcgcagg ggccccaggt tgggtgtgcg 4 80 
cgcgacaagg aagacttcgg agcggtccca gccacgtgga aggcgccagc ccatccctaa 540 
agatcggcgc tccactggca aatcctgggg aaaaccagga tacccctggc ccctatacgg 60 0 
gaatgaggga ctcggctggg caggatggct cctgtccccc cgaggttccc gtccctcttg 660 
gggccccaat gacccccggc ataggtcgcg caacgtgggt aaggtcatcg ataccctaac 72 0 
gtgcggcttt gccgacctca tggggtacat ccctgtcgtg ggcgccccgc tcggcggcgt 7 80 
cgccagagct ctcgcgcatg gcgtgagagt cctggaggac ggggttaatt ttgcaacagg 840 
gaacttaccc ggttgctcct tttctatctt cttgctggcc ctgctgtcct gcatcaccac 900 
cccggtctcc gctgccgaag tgaagaacat cagtaccggc tacatggtga ctaacgactg 960 
caccaatgac agcattacct ggcagctcca ggctgctgtc ctccacgtcc ccgggtgcgt 102 0 
cccgtgcgag aaagtgggga atgcatctca gtgctggata ccggtctcac cgaatgtggc 1080 
cgtgcagcgg cccggcgccc tcacgcaggg cttgcggacg cacatcgaca tggttgtgat 114 0 
gtccgccacg ctctgctctg ccctctacgt gggggacctc tgcggtgggg tgatgctcgc 12 00 
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agcccaaatg ttcattgtct cgccgcagca ccactggttt gtccaagact gcaattgctc 1260 

catctaccct ggtaccatca ctggacaccg catggcatgg gacatgatga tgaactggtc 132 0 

gcccacggct accatgatct tggcgtacgc gatgcgtgtc cccgaggtca ttatagacat 13 80 

cattagcggg gctcattggg gcgtcatgtt cggcttggcc tacttctcta tgcagggagc 1440 

SJtggQcgaaa gtcgttgtca tccttctgtt ggccgccggg gtggacgcgc gcacccatac 1500 

tgttgggggt tctgccgcgc agaccaccgg gcgcctcacc agcttatttg acatgggccc 1560 

caggcagaaa atccagctcg ttaacaccaa tggcagctgg cacatcaacc gcaccgccct 1620 

gaactgcaat gactccttgc acaccggctt tatcgcgtct ctgttctaca cccacagctt 1680 

caactcgtca ggatgtcccg aacgcatgtc cgcctgccgc agtatcgagg ccttccgggt 174 0 

gggatggggc gccttgcaat atgaggataa tgtcaccaat ccagaggata tgagacccta 1800 

ttgctggcac tacccaccaa ggcagtgtgg cgtggtctcc gcgaagactg tgtgtggccc 1860 

agtgtactgt ttcaccccca gcccagtggt agtgggcacg accgacaggc ttggagcgcc 192 0 

cacttacacg tggggggaga atgagacaga tgtcttccta ttgaacagca ctcgaccacc 1980 

gctggggtca tggttcggct gcacgtggat gaactcttct ggctacacca agacttgcgg 204 0 

cgcaccaccc tgccgtacta gagctgactt caacgccagc acggacctgt tgtgccccac 2100 

ggactgtttt aggaagcatc ctgataccac ttacctcaaa tgcggctctg ggccctggct 2160 

cacgccaagg tgcctgatcg actaccccta caggctctgg cattacccct gcacagttaa 222 0 

ctataccatc ttcaaaataa ggatgtatgt gggaggggtt gagcacaggc tcacggctgc 22 80 

atgcaatttc actcgtgggg atcgttgcaa cttggaggac agagacagaa gtcaactgtc 2340 

tcctttgttg cactccacca cggaatgggc cattttacct tgctcttact cggacctgcc 24 00 

cgccttgtcg actggtcttc tccacctcca ccaaaacatc gtggacgtac aattcatgta 2460 

tggcctatca cctgccctca caaaatacat cgtccgatgg gagtgggtaa tactcttatt 2520 

cctgctctta gcggacgcca gggtttgcgc ctgcttatgg atgctcatct tgttgggcca 2580 

ggccgaagca gcactagaga agctggtcat cttgcacgct gcgagcgcag ctagctgcaa 2640 

tggcttccta tattttgtca tctttttcgt ggctgcttgg tacatcaagg gtcgggtagt 2700 

ccccttagct acctattccc tcactggcct gtggtccttt agcctactgc tcctagcatt 2760 

gccccaacag gcttatgctt atgacgcatc tgtgcatggc cagataggag cggctctgct 2820 

ggtaatgatc actctcttta ctctcacccc cgggtataag acccttctca gccggttttt 2880 

gtggtggttg tgctatcttc tgaccctggg ggaagctatg gtccaggagt gggcaccacc 2 94 0 

tatgcaggtg cgcggtggcc gtgatggcat catatgggcc gtcgccatat tctacccagg 3000 

tgtggtgttt gacataacca agtggctctt ggcggtgctt gggcctgctt acctcctaaa 3060 

aggtgctttg acgcgcgtgc cgtacttcgt cagggctcac gctctactga ggatgtgcac 312 0 

catggcaagg catctcgcgg ggggcaggta cgtccagatg gcgctactag cccttggcag 3180 

gtggactggc acttacatct atgaccacct cacccctatg tcggattggg ctgctagtgg 3240 

cctgcgggac ctggcggtcg ccgttgagcc tatcatcttc agtccgatgg agaagaaagt 33 00 

cattgtctgg ggagcggaga cagctgcttg tggggacatt ttacacggac ttcccgtgtc 33 60 

cgcccgactt ggtcgggagg tcctccttgg cccagctgat ggctatacct ccaaggggtg 3420 

gagtcttctc gcccccatca ctgcttacgc ccagcagaca cgtggccttt tgggcaccat 34 80 

agtggtgagc atgacggggc gcgacaagac agaacaggct ggggaaattc aggtcctgtc 354 0 

cacagtcact cagtccttcc tcggaacatc catctcgggg gttttgtgga ctgtctacca 3600 

tggagctggc aacaagactc tggccggctc acggggtccg gtcacgcaga tgtactccag 3660 

tgctgagggg gacttagtag ggtggcccag cccccctggg actaaatctt tggagccgtg 3720 

cacgtgtgga gcggtcgacc tgtacctggt cacgcggaac gctgatgtca tcccggctcg 3780 

aagacgcggg gacaaacggg gagcgctact ctccccgaga cctctttcca ccttgaaggg 3840 

gtcctcagga ggcccggtgc tatgccccag gggccacgct gtcggagtct tccgggcagc 3900 

tgtgtgctct cggggcgtgg ctaagtccat agatttcatc cccgttgaga cactcgacat 3960 

cgtcacgcgg tcccccacct ttagtgacaa cagcacacca cctgctgtgc cccagaccta 4020 

tcaggtcggg tacttgcatg ccccgactgg cagtggaaag agcaccaaag ttcctgtcgc 4080 
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atatgctgct caggggtata aagtgctagt gcttaatccc tcagtggctg ccaccctggg 4140 
gtttggggcg tacttgtcta aggcacatgg catcaatccc aacattagga ctggagtcag 4200 
gactgtgacg accggggcgc ccatcacgta ctccacatat ggcaaattcc tcgccgatgg 4260 
gggctgtgcg ggcggcgcct acgacatcat catatgtgat gaatgccatg ccgtggactc 4 32 0 
taccaccatc cttggcatcg gaacagtcct tgatcaagca gagacagctg gggtcagact 43 80 
aactgtgctg gctacagcta cgccccctgg gtcagtgaca accccccacc ccaacataga 444 0 
99 a 99tggcc cttgggcagg agggcgagat ccccttctat gggagggcga ttcccctgtc 4500 
ttacatcaag ggaggaagac atctgatctt ctgccattca aagaaaaagt gtgacgagct 4560 
cgcggcggcc cttcggggta tgggcttgaa ctcagtggca tactacagag ggttggacgt 4 62 0 
ctccgtaata ccaactcagg gagacgtagt ggtcgtcgcc accgacgccc tcatgacagg 4680 
gtatactggg gactttgact ccgtgatcga ctgcaacgta gcggtcactc aagttgtaga 474 0 
cttcagttta gaccccacat tcaccataac cacacagatt gtccctcaag acgctgtctc 4 80O 
acgtagccag cgccggggtc gcacgggtag gggaagactg^ ggcatttata ggtatgtttc 4 860 
cactggtgag cgagcctcag gaatgtttga cagtgtagtg ctctgtgagt gctacgacgc 4 92 0 
aggggccgca tggtatgagc tcacaccatc ggagaccacc gtcaggctca gggcgtattt 4 980 
caacacgccc ggtttgcctg tgtgccaaga ccatcttgag ttttgggagg cagttttcac 5040 
cggcctcaca cacatagatg cccacttcct ttcccaaaca aagcaatcgg gggaaaattt 5100 
cgcatactta acagcctacc aggctacagt gtgcgctagg gccaaagccc cccccccgtc 5160 
ctgggacgtc atgtggaagt gtttgactcg actcaagccc acactcgtgg gccccacacc 5220 
tctcctgtac cgcttgggct ctgttaccaa cgaggtcacc ctcacacatc ccgtgacgaa 52 80 
atacatcgcc acctgcatgc aagccgacct tgaggtcatg accagcacat gggtcttggc 5340 
agggggagtc ttggcggccg tcgccgcgta ttgcctggcg accgggtgtg tttgcatcat 54 00 
cggccgcttg cacattaacc agcgagccgt cgttgcgccg gacaaggagg tcctctatga 54 60 
ggcttttgat gagatggagg aatgtgcctc tagggcggct ctcattgaag aggggcagcg 5520 
gatagccgag atgctgaagt ccaagatcca aggcttattg cagcaagctt ccaaacaagc 5580 
tcaagacata caacccactg tgcaggcttc atggcccaag gtagaacaat tctgggccaa 5640 
acacatgtgg aacttcatta gcggcatcca atacctcgca ggactatcaa cactgccagg 5700 
gaaccctgca gtagcttcca tgatggcgtt cagtgccgcc ctcaccagtc cgctgtcaac 5760 
aagcaccact atccttctca acattttggg gggctggcta gcatcccaaa ttgcaccacc 5820 
cgcgggggcc actggcttcg ttgtcagtgg cctagtggga gctgccgtag gcagtatagg 58 80 
cttaggtaag gtgctagtgg acatcctggc agggtatggt gcgggcattt cgggggctct 5940 
cgtcgcattc aagatcatgt ctggcgagaa gccctccatg gaggatgtcg tcaacttgct 6000 
gcctggaatt ctgtctccgg gtgccttggt agtgggagtc atctgcgcgg ccattctgcg 6060 
ccgacacgtg ggaccggggg aaggcgccgt ccaatggatg aatagactca ttgcctttgc 6120 
ttccagagga aatcacgtcg cccccaccca ctacgtgacg gagtcggatg cgtcgcagcg 6180 
tgtgacccaa ctacttggct cccttaccat aaccagcctg ctcagaagac tccacaactg 6240 
gattactgag gactgcccca tcccatgcgg cggctcgtgg ctccgcgatg tgtgggactg 6300 
ggtttgcacc atcctaacag actttaaaaa ttggctgacc tccaaattat tcccaaagat 6360 
gcccggcctc ccctttgtct cctgtcaaaa ggggtacaag ggcgtgtggg ccggcactgg 6420 
catcatgacc acacggtgtc cttgcggcgc caatatctct ggcaatgtcc gcttgggctc 64 80 
catgagaatc acggggccta agacctgcat gaatatctgg caggggacct ttcctatcaa 6540 
ttgttacacg gagggccagt gcgtgccgaa acccgcgcca aactttaagg tcgccatctg 6600 
gagggtggcg gcctcagagt acgcggaggt gacgcagcac gggtcatacc actacataac 6660 
aggactcacc actgataact tgaaagtccc ctgccaacta ccctctcccg agttcttttc 6720 
ctgggtggac ggagtgcaga tccataggtt tgcccccaca ccgaagccgt ttttccggga 6780 
tgaggtctcg ttctgcgttg ggcttaattc atttgtcgtc gggtcccagc ttccttgcga 6840 
ccctgaaccc gacacagacg tattgatgtc catgctaaca gatccatctc atatcacggc 6900 
ggagactgca gcgcggcgtt tagcgcgggg gtcaccccca tccgaggcaa gctcctcggc 6960 
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gagccagcta tcggcaccat cgctgcgagc cacctgcacc acccacggca aagcctatga 7 02 0 
tgtggacatg gtggatgcta acctgttcat ggggggcgat gtgactcgga tagagtctgg 7080 
gtccaaagtg gtcgttctgg actctctcga cccaatggtc gaagaaagga gcgaccttga 7140 
gccttcgata ccatcagaat acatgctccc caagaagagg ttcccaccag ctttaccggc 7200 
ctgggcacgg cctgattaca acccaccgct tgtggaatcg tggaaaaggc cagattacca 7260 
accggccact gttgcgggct gtgctctccc tcctcctagg aaaaccccga cgcctccccc 7320 
aaggaggcgc cggacagtgg gcctaagtga ggactccata ggagatgccc ttcaacagct 7380 
ggccattaag tcctttggcc agcccccccc aagcggcgat tcaggccttt ccacgggggc 7440 
gggcgctgcc gattccggca gtcagacgcc tcctgatgag ttggcccttt cggagacagg 7500 
ttccatctct tccatgcccc ccctcgaggg ggagcttgga gatccagacc tggagcctga 7560 
gcaggtagag ccccaacccc ccccccaggg gggggtggca gctcccggct cggactcggg 7620 
gtcctggtct acttgctccg aggaggacga ctccgtcgtg tgctgctcca tgtcatactc 7680 
ctggaccggg gctctaataa ctccttgtag tcccgaagag^ gagaagttac cgattaaccc 7740 
cttgagcaac tccctgttgc gatatcacaa caaggtgtac tgtaccacaa caaagagcgc 7800 
ctcactaagg gctaaaaagg taacttttga taggatgcaa gtgctcgact cctactacga 7860 
ctcagtctta aaggacatta agctagcggc ctccaaggtc accgcaaggc tcctcaccat 7920 
ggaggaggct tgccagttaa ccccacccca ttctgcaaga tctaaatatg ggtttggggc 7980 
taaggaggtc cgcagcttgt ccgggagggc cgttaaccac atcaagtccg tgtggaagga 804 0 
cctcctggag gactcagaaa caccaattcc cacaaccatt atggccaaaa atgaggtgtt 8100 
ctgcgtggac cccaccaagg ggggcaagaa agcagctcgc cttatcgttt accctgacct 8160 
cggcgtcagg gtctgcgaga agatggccct ttatgacatt acacaaaaac ttcctcaggc 8220 
ggtgatgggg gcttcttatg gattccagta ttcccccgct cagcgggtag agtttctctt 8280 
gaaagcatgg gcggaaaaga aggaccctat gggtttttcg tatgataccc gatgctttga 8340 
ctcaaccgtc actgagagag acatcaggac tgaggagtcc atatatcggg cctgctcctt 8400 
gcccgaggag gcccacactg ccatacactc gctaactgag agactttacg tgggagggcc 8460 
tatgttcaac agcaagggcc aaacctgcgg gtacaggcgt tgccgcgcca gcggggtgct 8520 
caccactagc atggggaaca ccatcacatg ctacgtgaaa gccttagcgg cttgtaaagc 8580 
tgcagggata atcgcgccca caatgctggt atgcggcgat gacttggttg tcatctcaga 8640 
aagccagggg accgaggagg acgagcggaa cctgagagcc ttcacggagg ctatgaccag 8700 
gtattctgcc cctcctggtg acccccccag accggagtat gatctggagc tgataacatc 8760 
ttgctcctca aatgtgtctg tggcgctggg cccacaaggc cgccgcagat actacctgac 8820 
cagagaccct accactccaa tcgcccgggc tgcctgggaa acagttagac actcccctgt 8880 
caattcatgg ctgggaaaca tcatccagta cgccccgacc atatgggctc gcatggtcct 8940 
gatgacacac ttcttctcca ttctcatggc tcaagacacg ctggaccaga acctcaactt 9000 
tgagatgtac ggagcggtgt actccgtgag tcccttggac ctcccagcta taattgaaag 9060 
gttacatggg cttgacgctt tttctctgca cacatacact ccccacgaac tgacacgggt 9120 
ggcttcagcc ctcagaaaac ttggggcgcc acccctcaga gcgtggaaga gccgggcacg 9180 
tgcagtcagg gcgtccctca tctcccgtgg ggggagagcg gccgtttgcg gtcgatatct 9240 
cttcaattgg gcggtgaaga ccaagctcaa actcactcca ttgccggaag cgcgcctcct 9300 
ggatttatcc agctggttca ccgtcggcgc cggcgggggc gacatttatc acagcgtgtc 9360 
gcgtgcccga ccccgcttat tgctctttgg cctactccta ctttttgtag gggtaggcct 9420 
tttcctactc cccgctcggt agagcggcac acattagcta cactccatag ctaactgtcc 9480 
cttttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 9540 
tttttttttt tttttttttt tttttctttt tttctctttt ccttctttct taccttattt 9600 
tactttcttt cctggtggct ccatcttagc cctagtcacg gctagctgtg aaaggtccgt 9660 
gagccgcatg actgcagaga gtgccgtaac tggtctctct gcagatcatg t 9711 
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<210> 2 
<211> 3033 
<212> PRT 

<213> Hepatitis C virus 
<400> 2 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
1 5 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Ar^r Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val lie Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Val Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys lie Thr Thr Pro Val Ser Ala Ala 
180 185 190 

Glu Val Lys Asn lie Ser Thr Gly Tyr Met Val Thr Asn Asp Cys Thr 
195 200 205 

Asn Asp Ser lie Thr Trp Gin Leu Gin Ala Ala Val Leu His Val Pro 
210 215 220 
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Gly Cys Val Pro Cys Glu Lys Val Gly Asn Ala Ser Gin Cys Trp lie 
225 230 235 240 

Pro Val Ser Pro Asn Val Ala Val Gin Arg Pro Gly Ala Leu Thr Gin 
245 250 255 

Gly Leu Arg Thr His lie Asp Met Val Val Met Ser Ala Thr Leu Cys 
260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 
275 280 285 

Gin Met Phe lie Val Ser Pro Gin His His Trp Phe Val Gin Asp Cys 
290 295 300 

Asn Cys Ser lie Tyr Pro Gly Thr lie Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met lie Leu Ala Tyr 
325 330 335 

Ala Met Arg Val Pro Glu Val lie lie Asp lie lie Ser Gly Ala His 
340 345 350 

Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 
355 360 365 

Ala Lys Val Val Val lie Leu Leu Leu Ala Ala Gly Val Asp Ala Arg 
370 375 380 

Thr His Thr Val Gly Gly Ser Ala Ala Gin Thr Thr Gly Arg Leu Thr 
385 390 395 400 

Ser Leu Phe Asp Met Gly Pro Arg Gin Lys lie Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu His Thr Gly Phe lie Ala Ser Leu Phe Tyr Thr His Ser Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Met Ser Ala Cys Arg Ser lie Glu Ala 
450 455 460 

Phe Arg Val Gly Trp Gly Ala Leu Gin Tyr Glu Asp Asn Val Thr Asn 
465 470 475 480 
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Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Arg Gin Cys 
485 490 495 

Gly Val Val Ser Ala Lys Thr Val Cys Gly Pro Val Tyr Cys Phe Thr 
500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Leu Gly Ala Pro Thr 
515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 
530 535 540 

Arg Pro Pro Leu Gly Ser Trp Phe Gly Cys THr Trp Met Asn Ser Ser 
545 550 555 560 

Gly Tyr Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 
565 570 575 

Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 
580 585 590 

His Pro Asp Thr Thr Tyr Leu Lys Cys Gly Ser Gly Pro Trp Leu Thr 
595 600 605 

Pro Arg Cys Leu lie Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
610 615 620 

Thr Val Asn Tyr Thr lie Phe Lys He Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 
645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 
660 665 670 

Thr Thr Glu Trp Ala He Leu Pro Cys Ser Tyr Ser Asp Leu Pro Ala 
675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn He Val Asp Val Gin 
690 695 700 

Phe Met Tyr Gly Leu Ser Pro Ala Leu Thr Lys Tyr He Val Arg Trp 
7 05 710 715 720 

Glu Trp Val He Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
725 730 735 
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Ala Cys Leu Trp Met Leu lie Leu Leu Gly Gin Ala Glu Ala Ala Leu 
740 745 750 

Glu Lys Leu Val lie Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly 
755 760 765 

Phe Leu Tyr Phe Val lie Phe Phe Val Ala Ala Trp Tyr lie Lys Gly 
770 775 780 

Arg Val Val Pro Leu Ala Thr Tyr Ser Leu Thr Gly Leu Trp Ser Phe 
785 790 795 800 

Ser Leu Leu Leu Leu Ala Leu Pro Gin Gin Alar Tyr Ala Tyr Asp Ala 
805 810 815 

Ser Val His Gly Gin lie Gly Ala Ala Leu Leu Val Met lie Thr Leu 
820 825 830 

Phe Thr Leu Thr Pro Gly Tyr Lys Thr Leu Leu Ser Arg Phe Leu Trp 
835 840 845 

Trp Leu Cys Tyr Leu Leu Thr Leu Gly Glu Ala Met Val Gin Glu Trp 
850 855 860 

Ala Pro Pro Met Gin Val Arg Gly Gly Arg Asp Gly lie lie Trp Ala 
865 870 875 880 

Val Ala lie Phe Tyr Pro Gly Val Val Phe Asp lie Thr Lys Trp Leu 
885 890 895 

Leu Ala Val Leu Gly Pro Ala Tyr Leu Leu Lys Gly Ala Leu Thr Arg 
900 905 910 

Val Pro Tyr Phe Val Arg Ala His Ala Leu Leu Arg Met Cys Thr Met 
915 920 925 

Ala Arg His Leu Ala Gly Gly Arg Tyr Val Gin Met Ala Leu Leu Ala 
930 935 940 

Leu Gly Arg Trp Thr Gly Thr Tyr lie Tyr Asp His Leu Thr Pro Met 
945 950 955 960 

Ser Asp Trp Ala Ala Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu 
965 970 975 

Pro lie lie Phe Ser Pro Met Glu Lys Lys Val lie Val Trp Gly Ala 
980 985 990 
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Glu Thr Ala Ala Cys Gly Asp lie Leu His Gly Leu Pro Val Ser Ala 
995 1000 1005 

Arg Leu Gly Arg Glu Val Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser 
1010 1015 1020 

Lys Gly Trp Ser Leu Leu Ala Pro lie Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Thr lie Val Val Ser Met Thr Gly Arg Asp Lys 
1045 1050 1055 

Thr Glu Gin Ala Gly Glu He Gin Val Leu Set Thr Val Thr Gin Ser 
1060 1065 1070 

Phe Leu Gly Thr Ser He Ser Gly Val Leu Trp Thr Val Tyr His Gly 
1075 1080 1085 

Ala Gly Asn Lys Thr Leu Ala Gly Ser Arg Gly Pro Val Thr Gin Met 
1090 1095 1100 

Tyr Ser Ser Ala Glu Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly 
1105 1110 1115 1120 

Thr Lys Ser Leu Glu Pro Cys Thr Cys Gly Ala Val Asp Leu Tyr Leu 
1125 1130 1135 

Val Thr Arg Asn Ala Asp Val He Pro Ala Arg Arg Arg Gly Asp Lys 
1140 1145 1150 

Arg Gly Ala Leu Leu Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser 
1155 1160 1165 

Ser Gly Gly Pro Val Leu Cys Pro Arg Gly His Ala Val Gly Val Phe 
1170 1175 1180 

Arg Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser He Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Thr Leu Asp He Val Thr Arg Ser Pro Thr Phe Ser Asp 
1205 1210 1215 

Asn Ser Thr Pro Pro Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu 
1220 1225 1230^ 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr 
1235 1240 1245 
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Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 
1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro 
1265 1270 1275 1280 

Asn lie Arg Thr Gly Val Arg Thr Val Thr Thr Gly Ala Pro He Thr 
1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Gly Gly 
1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys Hi sT Ala Val Asp Ser Thr 
1315 1320 1325 

Thr He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 
1330 1335 1340 

Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Thr Pro His Pro Asn He Glu Glu Val Ala Leu Gly Gin Glu Gly Glu 
1365 1370 1375 

He Pro Phe Tyr Gly Arg Ala He Pro Leu Ser Tyr He Lys Gly Gly 
1380 1385 1390 

Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 
1395 1400 1405 

Ala Ala Leu Arg Gly Met Gly Leu Asn Ser Val Ala Tyr Tyr Arg Gly 
1410 1415 1420 

Leu Asp Val Ser Val He Pro Thr Gin Gly Asp Val Val Val Val Ala 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 
1445 1450 1455 

Asp Cys Asn Val Ala Val Thr Gin Val Val Asp Phe Ser Leu Asp Pro 
1460 1465 1470 

Thr Phe Thr He Thr Thr Gin He Val Pro Gin Asp Ala Val Ser Arg 
1475 1480 1485 

Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Leu Gly He Tyr Arg 
1490 1495 1500 
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Tyr Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Glu Leu Thr Pro 
1525 1530 1535 

Ser Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu 
1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 
1555 1560 1565 

Leu Thr His lie Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 
1570 1575 1580 

Glu Asn Phe Ala Tyr Leu Thr Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Lys Ala Pro Pro Pro Ser Trp Asp Val Met Trp Lys Cys Leu Thr 
1605 1610 1615 

Arg Leu Lys Pro Thr Leu Val Gly Pro Thr Pro Leu Leu Tyr Arg Leu 
1620 1625 1630 

Gly Ser Val Thr Asn Glu Val Thr Leu Thr His Pro Val Thr Lys Tyr 
1635 1640 1645 

lie Ala Thr Cys Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp 
1650 1655 1660 

Val Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 
1665 1670 1675 1680 

Thr Gly Cys Val Cys lie He Gly Arg Leu His He Asn Gin Arg Ala 
1685 1690 1695 

Val Val Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met 
1700 1705 1710 

Glu Glu Cys Ala Ser Arg Ala Ala Leu He Glu Glu Gly Gin Arg He 
1715 1720 1725 

Ala Glu Met Leu Lys Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser 
1730 1735 1740 

Lys Gin Ala Gin Asp He Gin Pro Thr Val Gin Ala Ser Trp Pro Lys 
1745 1750 1755 1760 



11 



WO 00/75338 PCT/US00/15446 

Val Glu Gin Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly He 
1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala 
1780 1785 1790 

Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser 
1795 1800 1805 

Thr Thr He Leu Leu Asn He Leu Gly Gly Trp Leu Ala Ser Gin He 
1810 1815 1820 

Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly 
1825 1830 1835 1840 

Ala Ala Val Gly Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu 
1845 1850 1855 

Ala Gly Tyr Gly Ala Gly He Ser Gly Ala Leu Val Ala Phe Lys He 
I860 1865 1870 

Met Ser Gly Glu Lys Pro Ser Met Glu Asp Val Val Asn Leu Leu Pro 
1875 1880 1885 

Gly He Leu Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Ala 
1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr 
1925 1930 1935 

His Tyr Val Thr Glu Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu 
1940 1945 1950 

Gly Ser Leu Thr He Thr Ser Leu Leu Arg Arg Leu His Asn Trp He 
1955 1960 1965 

Thr Glu Asp Cys Pro He Pro Cys Gly Gly Ser Trp Leu Arg Asp Val 
1970 1975 1980 

Trp Asp Trp Val Cys Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr 
1985 1990 1995 2000 

Ser Lys Leu Phe Pro Lys Met Pro Gly Leu Pro Phe Val Ser Cys Gin 
2005 2010 2015 
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Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly lie Met Thr Thr Arg 
2020 2025 2030 

Cys Pro Cys Gly Ala Asn lie Ser Gly Asn Val Arg Leu Gly Ser Met 
2035 2040 2045 

Arg lie Thr Gly Pro Lys Thr Cys Met Asn lie Trp Gin Gly Thr Phe 
.2050 2055 2060 

Pro He Asn Cys Tyr Thr Glu Gly Gin Cys Val Pro Lys Pro Ala Pro 
2065 2070 2075 2080 

Asn Phe Lys Val Ala He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu 
2085 2090 2095 

Val Thr Gin His Gly Ser Tyr His Tyr He Thr Gly Leu Thr Thr Asp 
2100 2105 2110 

Asn Leu Lys Val Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp 
2115 2120 2125 

Val Asp Gly Val Gin He His Arg Phe Ala Pro Thr Pro Lys Pro Phe 
2130 2135 2140 

Phe Arg Asp Glu Val Ser Phe Cys Val Gly Leu Asn Ser Phe Val Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Asp Pro Glu Pro Asp Thr Asp Val Leu Met 
2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu Thr Ala Ala Arg 
2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Ala Ser 
2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Gly Lys 
2210 2215 2220 

Ala Tyr Asp Val Asp Met Val Asp Ala Asn Leu Phe Met Gly Gly Asp 
2225 2230 2235 2240 

Val Thr Arg He Glu Ser Gly Ser Lys Val Val Val Leu Asp Ser Leu 
2245 2250 2255 

Asp Pro Met Val Glu Glu Arg Ser Asp Leu Glu Pro Ser He Pro Ser 
2260 2265 2270 
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Glu Tyr Met Leu Pro Lys Lys Arg Phe Pro Pro Ala Leu Pro Ala Trp 
2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Lys Arg Pro 
2290 2295 2300 

Asp Tyr Gin Pro Ala Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Arg 
2305 2310 2315 2320 

Lys Thr Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser 
2325 2330 2335 

Glu Asp Ser lie Gly Asp Ala Leu Gin Gin LeuT Ala lie Lys Ser Phe 
2340 * 2345 2350 

Gly Gin Pro Pro Pro Ser Gly Asp Ser Gly Leu Ser Thr Gly Ala Gly 
2355 2360 2365 

Ala Ala Asp Ser Gly Ser Gin Thr Pro Pro Asp Glu Leu Ala Leu Ser 
2370 2375 2380 

Glu Thr Gly Ser lie Ser Ser Met Pro Pro Leu Glu Gly Glu Leu Gly 
2385 2390 2395 2400 

Asp Pro Asp Leu Glu Pro Glu Gin Val Glu Pro Gin Pro Pro Pro Gin 
2405 2410 2415 

Gly Gly Val Ala Ala Pro Gly Ser Asp Ser Gly Ser Trp Ser Thr Cys 
2420 2425 2430 

Ser Glu Glu Asp Asp Ser Val Val Cys Cys Ser Met Ser Tyr Ser Trp 
2435 2440 2445 

Thr Gly Ala Leu lie Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro 
2450 2455 2460 

lie Asn Pro Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr 
2465 2470 2475 2480 

Cys Thr Thr Thr Lys Ser Ala Ser Leu Arg Ala Lys Lys Val Thr Phe 
2485 2490 2495 

Asp Arg Met Gin Val Leu Asp Ser Tyr Tyr Asp Ser Val Leu Lys Asp 
2500 2505 2510 

lie Lys Leu Ala Ala Ser Lys Val Thr Ala Arg Leu Leu Thr Met Glu 
2515 2520 2525 
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Glu Ala Cys Gin Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly 
2530 2535 2540 

Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala Val Asn His 
2545 2550 2555 2560 

lie Lys Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Glu Thr Pro lie 
2565 2570 2575 

Pro Thr Thr lie Met Ala Lys Asn Glu Val Phe Cys Val Asp Pro Thr 
2580 2585 2590 

Lys Gly Gly Lys Lys Ala Ala Arg Leu He Val Tyr Pro Asp Leu Gly 
2595 2600 2605 

Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp He Thr Gin Lys Leu 
2610 2615 2620 

Pro Gin Ala Val Met Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 
2625 2630 2635 2640 

Gin Arg Val Glu Phe Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro 
2645 2650 2655 

Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu 
2660 2665 2670 

Arg Asp He Arg Thr Glu Glu Ser He Tyr Arg Ala Cys Ser Leu Pro 
2675 2680 2685 

Glu Glu Ala His Thr Ala He His Ser Leu Thr Glu Arg Leu Tyr Val 
2690 2695 2700 

Gly Gly Pro Met Phe Asn Ser Lys Gly Gin Thr Cys Gly Tyr Arg Arg 
2705 2710 2715 2720 

Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn Thr He Thr 
2725 2730 2735 

Cys Tyr Val Lys Ala Leu Ala Ala Cys Lys Ala Ala Gly He He Ala 
2740 2745 2750 

Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Ser Glu Ser 
2755 2760 2765 

Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala 
2770 2775 2780 
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Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr 
2785 2790 2795 2800 

Asp Leu Glu Leu lie Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu 
2805 2810 2815 

Gly Pro Gin Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr 
2820 2825 2830 

Pro lie Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro Val Asn 
2835 2840 2845 

Ser Trp Leu Gly Asn lie lie Gin Tyr Ala Pro Thr lie Trp Ala Arg 
2850 2855 2860 

Met Val Leu Met Thr His Phe Phe Ser lie Leu Met Ala Gin Asp Thr 
2865 2870 2875 2880 

Leu Asp Gin Asn Leu Asn Phe Glu Met Tyr Gly Ala Val Tyr Ser Val 
2885 2890 2895 

Ser Pro Leu Asp Leu Pro Ala lie lie Glu Arg Leu His Gly Leu Asp 
2900 2905 2910 

Ala Phe Ser Leu His Thr Tyr Thr Pro His Glu Leu Thr Arg Val Ala 
2915 2920 2925 

Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Ala Trp Lys Ser 
2930 2935 2940 

Arg Ala Arg Ala Val Arg Ala Ser Leu lie Ser Arg Gly Gly Arg Ala 
2945 2950 2955 2960 

Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu 
2965 2970 2975 

Lys Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp 
2980 2985 2990 

Phe Thr Val Gly Ala Gly Gly Gly Asp lie Tyr His Ser Val Ser Arg 
2995 3000 3005 

Ala Arg Pro Arg Leu Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly 
3010 3015 " 3020 

Val Gly Leu Phe Leu Leu Pro Ala Arg 
3025 3030 
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<210> 3 
<211> 9611 
<212> DNA 

<213> Hepatitis C virus 
<400> 3 

gccagccccc tgatgggggc gacactccac catgaatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gataaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcaagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 3 00 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcaca aatcctaaac 360 
ctcaaagaaa aaccaaaaga aacaccaacc gtcgcccaca agacgttaag tttccgggcg 420 
gcggccagat cgttggcgga gtatacttgt tgccgcgcag gggccccagg ttgggtgtgc 480 
gcgcgacaag gaagacttcg gagcggtccc agccacgtgg aaggcgccag cccatcccta 54 0 
aagatcggcg ctccactggc aaatcctggg gaaaaccagg atacccctgg cccctatacg 600 
ggaatgaggg actcggctgg gcaggatggc tcctgtcccc ccgaggttcc cgtccctctt 660 
ggggccccaa tgacccccgg cataggtcgc gcaacgtggg taaggtcatc gataccctaa 720 
cgtgcggctt tgccgacctc atggggtaca tccctgtcgt gggcgccccg ctcggcggcg 780 
tcgccagagc tctcgcgcat ggcgtgagag tcctggagga cggggttaat tttgcaacag 84 0 
ggaacttacc cggttgctcc ttttctatct tcttgctggc cctgctgtcc tgcatcacca 900 
ccccggtctc cgctgccgaa gtgaagaaca tcagtaccgg ctacatggtg actaacgact 960 
gcaccaatga cagcattacc tggcagctcc aggctgctgt cctccacgtc cccgggtgcg 1020 
tcccgtgcga gaaagtgggg aatgcatctc agtgctggat accggtctca ccgaatgtgg 1080 
ccgtgcagcg gcccggcgcc ctcacgcagg gcttgcggac gcacatcgac atggttgtga 1140 
tgtccgccac gctctgctct gccctctacg tgggggacct ctgcggtggg gtgatgctcg 1200 
cagcccaaat gttcattgtc tcgccgcagc accactggtt tgtccaagac tgcaattgct 1260 
ccatctaccc tggtaccatc actggacacc gcatggcatg ggacatgatg atgaactggt 1320 
cgcccacggc taccatgatc ttggcgtacg cgatgcgtgt ccccgaggtc attatagaca 1380 
tcattagcgg ggctcattgg ggcgtcatgt tcggcttggc ctacttctct atgcagggag 144 0 
cgtgggcgaa agtcgttgtc atccttctgt tggccgccgg ggtggacgcg cgcacccata 1500 
ctgttggggg ttctgccgcg cagaccaccg ggcgcctcac cagcttattt gacatgggcc 1560 
ccaggcagaa aatccagctc gttaacacca atggcagctg gcacatcaac cgcaccgccc 1620 
tgaactgcaa tgactccttg cacaccggct ttatcgcgtc tctgttctac acccacagct 1680 
tcaactcgtc aggatgtccc gaacgcatgt ccgcctgccg cagtatcgag gccttccggg 174 0 
tgggatgggg cgccttgcaa tatgaggata atgtcaccaa tccagaggat atgagaccct 1800 
attgctggca ctacccacca aggcagtgtg gcgtggtctc cgcgaagact gtgtgtggcc 1860 
cagtgtactg tttcaccccc agcccagtgg tagtgggcac gaccgacagg cttggagcgc 1920 
ccacttacac gtggggggag aatgagacag atgtcttcct attgaacagc actcgaccac 1980 
cgctggggtc atggttcggc tgcacgtgga tgaactcttc tggctacacc aagacttgcg 2 04 0 
gcgcaccacc ctgccgtact agagctgact tcaacgccag cacggacctg ttgtgcccca 2100 
cggactgttt taggaagcat cctgatacca cttacctcaa atgcggctct gggccctggc 2160 
tcacgccaag gtgcctgatc gactacccct acaggctctg gcattacccc tgcacagtta 2220 
actataccat cttcaaaata aggatgtatg tgggaggggt tgagcacagg ctcacggctg 2280 
catgcaattt cactcgtggg gatcgttgca acttggagga cagagacaga agtcaactgt 2340 
ctcctttgtt gcactccacc acggaatggg ccattttacc ttgctcttac tcggacctgc 2400 
ccgccttgtc gactggtctt ctccacctcc accaaaacat cgtggacgta caattcatgt 2460 
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atggcctatc acctgccctc acaaaataca tcgtccgatg ggagtgggta atactcttat 2520 
tcctgctctt agcggacgcc agggtttgcg cctgcttatg gatgctcatc ttgttgggcc 2 580 
aggccgaagc agcactagag aagctggtca tcttgcacgc tgcgagcgca gctagctgca 264 0 
atggcttcct atattttgtc atctttttcg tggctgcttg gtacatcaag ggtcgggtag 2700 
tccccttagc tacctattcc ctcactggcc tgtggtcctt tagcctactg ctcctagcat 2760 
tgccccaaca ggcatatgca ctggacacgg aggtggccgc gtcgtgtggc ggcgttgttc 2820 
ttgtcgggtt aatggcgctg actctgtcgc catattacaa gcgctatatc agctggtgca 2880 
tgtggtggct tcagtatttt ctgaccagag tagaagcgca actgcacgtg tgggttcccc 2 940 
ccctcaacgt ccgggggggg cgcgatgccg teat ct tact catgtgtgta gtacacccga 3000 
ccctggtatt tgacatcacc aaactactcc tggecatett cggacccctt tggattcttc 3060 
aagccagttt gcttaaagtc ccctacttcg tgcgcgttca aggccttctc eggatctgeg 3120 
cgctagcgcg gaagatagee ggaggtcatt acgtgcaaat ggccatcatc aagttagggg 3180 
cgcttactgg cacctatgtg tataaccatc tcacccctct tcgagactgg gcgcacaacg 3240 
gectgegaga tctggccgtg gctgtggaac cagtegtett ctcccgaatg gagaccaagc 3300 
teatcaegtg gggggcagat accgccgcgt gcggtgacat catcaacggc ttgcccgtct 3 360 
ctgcccgtag gggecaggag atactgettg ggccagccga cggaatggtc tccaaggggt 3420 
ggaggttgct ggcgcccatc aeggegtacg cccagcagac gagaggcetc ctagggtgta 3480 
taatcaccag cctgactggc egggacaaaa accaagtgga gggtgaggtc cagatcgtgt 3540 
caactgctac ccaaaccttc ctggcaacgt gcatcaatgg ggtatgctgg actgtctacc 3600 
aeggggcegg aacgaggacc atcgcatcac ccaagggtcc tgtcatccag atgtatacca 3660 
atgtggacca agaccttgtg ggctggcccg ctcctcaagg ttcccgctca ttgacaccct 3720 
gtacctgegg ctcctcggac ctttacctgg tcacgaggca cgccgatgtc attcccgtgc 3780 
geeggegagg tgatagcagg ggtagcctgc tttcgccccg gcccatttcc tacttgaaag 3 840 
gctcctcggg gggtccgctg ttgtgccccg cgggacacgc cgtgggccta ttcagggccg 3900 
cggtgtgcac ccgtggagtg getaaagegg tggactttat ccctgtggag aacctaggga 3 960 
caaccatgag atccccggtg ttcaeggaca actcctctcc accagcagtg ccccagagct 4020 
tccaggtggc ccacctgcat gctcccaccg geageggtaa gagcaccaag gtcccggctg 4 080 
cgtacgcagc ccagggctac aaggtgttgg tgctcaaccc ctctgttgct geaaegctgg 4140 
gctttggtgc ttacatgtcc aaggcccatg gggttgatcc taatatcagg accggggtga 42 00 
gaacaattac cactggcagc cccatcacgt actccaccta eggcaagtte cttgccgacg 4260 
gcgggtgctc aggaggtget tatgacataa taatttgtga cgagtgccac tecaeggatg 4320 
ccacatccat cttgggcatc ggcactgtcc ttgaccaagc agagactgeg ggggegagae 4380 
tggttgtgct cgccactgct acccctccgg gctccgtcac tgtgtcccat cctaacatcg 4440 
aggaggttgc tctgtccacc aceggagaga tcccctttta eggcaagget atccccctcg 4500 
aggtgatcaa ggggggaaga catctcatct tctgccactc aaagaagaag tgegacgage 4560 
tcgccgcgaa getggtcgea ttgggcatca ^.tgccgtggc ctactaccgc ggtcttgacg 4620 
tgtctgtcat cccgaccagc ggcgatgttg tcgtcgtgtc gaecgatget ctcatgactg 4680 
getttacegg cgacttcgac tctgtgatag actgcaacac gtgtgtcact cagacagtcg 4740 
atttcagect tgaccctacc tttaccattg agacaaccac gctcccccag gatgetgtet 4 800 
ccaggactca acgccggggc aggactggca gggggaagee aggcatctat agatttgtgg 4860 
caceggggga gcgcccctcc ggcatgttcg actcgtccgt cctctgtgag tgctatgacg 4920 
cgggctgtgc ttggtatgag ctcacgcccg ccgagactac agttaggcta egagegtaca 4 980 
tgaacacccc ggggcttccc gtgtgccagg accatcttga attttgggag ggegtcttta 5040 
cgggcctcac tcatatagat gcccactttt tatcccagac aaagcagagt ggggagaact 5100 
ttccttacct ggtagegtae caagccaccg tgtgcgctag ggctcaagcc cctcccccat 5160 
cgtgggacca gatgtggaag tgtttgatcc gccttaaacc caccctccat gggccaacac 5220 
ccctgctata cagactgggc gctgttcaga atgaagtcac cctgacgcac ccaatcacca 52 80 
aatacatcat gaeatgeatg tcggccgacc tggaggtcgt cacgagcacc tgggtgctcg 5340 
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ttggcggcgt cctggctgct ctggccgcgt attgcctgtc aacaggctgc gtggtcatag 5400 
tgggcaggat cgtcttgtcc gggaagccgg caattatacc tgacagggag gttctctacc 5460 
aggagttcga tgagatggaa gagtgctctc agcacttacc gtacatcgag caagggatga 5520 
tgctcgctga gcagttcaag cagaaggccc tcggcctcct gcagaccgcg tcccgccatg 5580 
cagaggttat cacccctgct gtccagacca actggcagaa actcgaggtc ttttgggcga 5640 
agcacatgtg gaatttcatc agtgggatac aatacttggc gggcctgtca acgctgcctg 5700 
gtaaccccgc cattgcttca ttgatggctt ttacagctgc cgtcaccagc ccactaacca 5760 
ctggccaaac cctcctcttc aacatattgg gggggtgggt ggctgcccag ctcgccgccc 5820 
ccggtgccgc tactgccttt gtgggtgctg gcctagctgg cgccgccatc ggcagcgttg 5880 
gactggggaa ggtcctcgtg gacattcttg cagggtatgg cgcgggcgtg gcgggagctc 594 0 
ttgtagcatt caagatcatg agcggtgagg tcccctccac ggaggacctg gtcaatctgc 6000 
tgcccgccat cctctcgcct ggagcccttg tagtcggtgt ggtctgcgca gcaatactgc 6060 
gccggcacgt tggcccgggc gagggggcag tgcaatggat gaaccggcta atagccttcg 6120 
cctcccgggg gaaccatgtt tcccccacgc actacgtgcc ggagagcgat gcagccgccc 6180 
gcgtcactgc catactcagc agcctcactg taacccagct cctgaggcga ctgcatcagt 6240 
ggataagctc ggagtgtacc actccatgct ccggttcctg gctaagggac atctgggact 63 00 
ggatatgcga ggtgctgagc gactttaaga cctggctgaa agccaagctc atgccacaac 63 60 
tgcctgggat tccctttgtg tcctgccagc gcgggtatag gggggtctgg cgaggagacg 6420 
gcattatgca cactcgctgc cactgtggag ctgagatcac tggacatgtc aaaaacggga 64 80 
cgatgaggat cgtcggtcct aggacctgca ggaacatgtg gagtgggacg ttccccatta 654 0 
acgcctacac cacgggcccc tgtactcccc ttcctgcgcc gaactataag ttcgcgctgt 6600 
ggagggtgtc tgcagaggaa tacgtggaga taaggcgggt gggggacttc cactacgtat 6660 
cgggtatgac tactgacaat cttaaatgcc cgtgccagat cccatcgccc gaatttttca 6720 
cagaattgga cggggtgcgc ctacacaggt ttgcgccccc ttgcaagccc ttgctgcggg 6780 
aggaggtatc attcagagta ggactccacg agtacccggt ggggtcgcaa ttaccttgcg 684 0 
agcccgaacc ggacgtagcc gtgttgacgt ccatgctcac tgatccctcc catataacag 6900 
cagaggcggc cgggagaagg ttggcgagag ggtcaccccc ttctatggcc agctcctcgg 6960 
ctagccagct gtccgctcca tctctcaagg caacttgcac cgccaaccat gactcccctg 7020 
acgccgagct catagaggct aacctcctgt ggaggcagga gatgggcggc aacatcacca 7080 
gggttgagtc agagaacaaa gtggtgattc tggactcctt cgatccgctt gtggcagagg 7140 
aggatgagcg ggaggtctcc gtacctgcag aaattctgcg gaagtctcgg agattcgccc 7200 
gggccctgcc cgtctgggcg cggccggact acaacccccc gctagtagag acgtggaaaa 7260 
agcctgacta cgaaccacct gtggtccatg gctgcccgct accacctcca cggtcccctc 7 320 
ctgtgcctcc gcctcggaaa aagcgtacgg tggtcctcac cgaatcaacc ctatctactg 7380 
ccttggccga gcttgccacc aaaagttttg gcagctcctc aacttccggc attacgggcg 744 0 
acaatacgac aacatcctct gagcccgccc cttctggctg cccccccgac tccgacgttg 7500 
agtcctattc ttccatgccc cccctggagg gggagcctgg ggatccggat ctcagcgacg 7560 
ggtcatggtc gacggtcagt agtggggccg acacggaaga tgtcgtgtgc tgctcaatgt 7620 
cttattcctg gacaggcgca ctcgtcaccc cgtgcgctgc ggaagaacaa aaactgccca 7680 
tcaacgcact gagcaactcg ttgctacgcc atcacaatct ggtgtattcc accacttcac 774 0 
gcagtgcttg ccaaaggcag aagaaagtca catttgacag actgcaagtt ctggacagcc 7800 
attaccagga cgtgctcaag gaggtcaaag cagcggcgtc aaaagtgaag gctaacttgc 7 860 
tatccgtaga ggaagcttgc agcctgacgc ccccacattc agccaaatcc aagtttggct 7 92 0 
atggggcaaa agacgtccgt tgccatgcca gaaaggccgt agcccacatc aactccgtgt 7980 
ggaaagacct tctggaagac agtgtaacac caatagacac taccatcatg gccaagaacg 804 0 
aggttttctg cgttcagcct gagaaggggg gtcgtaagcc agctcgtctc atcgtgttcc 8100 
ccgacctggg cgtgcgcgtg tgcgagaaga tggccctgta cgacgtggtt agcaagctcc 8160 
ccctggccgt gatgggaagc tcctacggat tccaatactc accaggacag cgggttgaat 8220 
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tcctcgtgca agcgtggaag tccaagaaga ccccgatggg gttctcgtat gatacccgct 8280 

gttttgactc cacagtcact gagagcgaca tccgtacgga ggaggcaatt taccaatgtt 8340 

gtgacctgga cccccaagcc cgcgtggcca tcaagtccct cactgagagg ctttatgttg 84 00 

ggggccctct taccaattca aggggggaaa actgcggcta ccgcaggtgc cgcgcgagcg 8460 

gcgtactgac aactagctgt ggtaacaccc tcacttgcta catcaaggcc cgggcagcct 8520 

gtcgagccgc agggctccag gactgcacca tgctcgtgtg tggcgacgac ttagtcgtta 8580 

tctgtgaaag tgcgggggtc caggaggacg cggcgagcct gagagccttc acggaggcta 8640 

tgaccaggta ctccgccccc cccggggacc ccccacaacc agaatacgac ttggagctta 8700 

taacatcatg ctcctccaac gtgtcagtcg cccacgacgg cgctggaaag agggtctact 8760 

accttacccg tgaccctaca acccccctcg cgagagccgc gtgggagaca gcaagacaca 8820 

ctccagtcaa ttcctggcta ggcaacataa tcatgtttgc ccccacactg tgggcgagga 8880 

tgatactgat gacccatttc tttagcgtcc tcatagccag ggatcagctt gaacaggctc 8940 

ttaactgtga gatctacgga gcctgctact ccatagaacc actggatcta cctccaatca 9000 

ttcaaagact ccatggcctc agcgcatttt cactccacag ttactctcca ggtgaaatca 9060 

atagggtggc cgcatgcctc agaaaacttg gggtcccgcc cttgcgagct tggagacacc 9120 

gggcccggag cgtccgcgct aggcttctgt ccagaggagg cagggctgct atatgtggca 9180 

agtacctctt caactgggca gtaagaacaa agctcaaact cactccaata gcggccgctg 9240 

gccggctgga cttgtccggt tggttcacgg ctggctacag cgggggagac atttatcaca 9300 

gcgtgtctca tgcccggccc cgctggttct ggttttgcct actcctgctc gctgcagggg 9360 

taggcatcta cctcctcccc aaccgatgaa ggttggggta aacactccgg cctcttaagc 9420 

catttcctgt tttttttttt tttttttttt tttttttctt tttttttttc tttcctttcc 9480 

ttcttttttt cctttctttt tcccttcttt aatggtggct ccatcttagc cctagtcacg 9540 

gctagctgtg aaaggtccgt gagccgcatg actgcagaga gtgctgatac tggcctctct 9600 

gcagatcatg t 9611 

<210> 4 
<211> 3015 
<212> PRT 

<213> Hepatitis C virus 
<400> 4 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
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Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 



Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 



lie Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly 
65 70 75 80 



Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 
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85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val lie Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Val Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys lie Thr Thr Pro Val Ser Ala Ala 
180 185 190 

Glu Val Lys Asn lie Ser Thr Gly Tyr Met Val Thr Asn Asp Cys Thr 
195 200 205 

Asn Asp Ser lie Thr Trp Gin Leu Gin Ala Ala Val Leu His Val Pro 
210 215 220 

Gly Cys Val Pro Cys Glu Lys Val Gly Asn Ala Ser Gin Cys Trp lie 
225 230 235 240 

Pro Val Ser Pro Asn Val Ala Val Gin Arg Pro Gly Ala Leu Thr Gin 
245 250 255 

Gly Leu Arg Thr His lie Asp Met Val Val Met Ser Ala Thr Leu Cys 
260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 
275 280 " 285 

Gin Met Phe lie Val Ser Pro Gin His His Trp Phe Val Gin Asp Cys 
290 295 300 

Asn Cys Ser lie Tyr Pro Gly Thr lie Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met lie Leu Ala Tyr 
325 330 335 

Ala Met Arg Val Pro Glu Val lie lie Asp lie lie Ser Gly Ala His 
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340 345 350 

Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 
355 360 365 

Ala Lys Val Val Val He Leu Leu Leu Ala Ala Gly Val Asp Ala Arg 
370 375 380 

Thr His Thr Val Gly Gly Ser Ala Ala Gin Thr Thr Gly Arg Leu Thr 
385 390 395 400 

Ser Leu Phe Asp Met Gly Pro Arg Gin Lys He Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu His Thr Gly Phe He Ala Ser Leu Phe Tyr Thr His Ser Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Met Ser Ala Cys Arg Ser He Glu Ala 
450 455 460 

Phe Arg Val Gly Trp Gly Ala Leu Gin Tyr Glu Asp Asn Val Thr Asn 
465 470 475 480 

Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Arg Gin Cys 
485 490 495 

Gly Val Val Ser Ala Lys Thr Val Cys Gly Pro Val Tyr Cys Phe Thr 
500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Leu Gly Ala Pro Thr 
515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 
530 535 540 

Arg Pro Pro Leu Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Ser 
545 550 555 560 

Gly Tyr Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 
565 570 575 

Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 
580 585 590 

His Pro Asp Thr Thr Tyr Leu Lys Cys Gly Ser Gly Pro Trp Leu Thr 
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595 600 605 

Pro Arg Cys Leu lie Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
610 615 620 

Thr Val Asn Tyr Thr lie Phe Lys lie Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 
645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 
660 665 670 

Thr Thr Glu Trp Ala lie Leu Pro Cys Ser Tyr Ser Asp Leu Pro Ala 
675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn lie Val Asp Val Gin 
690 695 700 

Phe Met Tyr Gly Leu Ser Pro Ala Leu Thr Lys Tyr lie Val Arg Trp 
705 710 715 720 

Glu Trp Val lie Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
725 730 735 

Ala Cys Leu Trp Met Leu lie Leu Leu Gly Gin Ala Glu Ala Ala Leu 
740 745 750 

Glu Lys Leu Val lie Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly 
755 760 765 

Phe Leu Tyr Phe Val lie Phe Phe Val Ala Ala Trp Tyr lie Lys Gly 
770 775 780 

Arg Val Val Pro Leu Ala Thr Tyr Ser Leu Thr Gly Leu Trp Ser Phe 
785 790 795 800 

Ser Leu Leu Leu Leu Ala Leu Pro Gin Gin Ala Tyr Ala Leu Asp Thr 
805 810 815 

Glu Val Ala Ala Ser Cys Gly Gly Val Val Leu Val Gly Leu Met Ala 
820 825 830 

Leu Thr Leu Ser Pro Tyr Tyr Lys Arg Tyr lie Ser Trp Cys Met Trp 
835 840 845 

Trp Leu Gin Tyr Phe Leu Thr Arg Val Glu Ala Gin Leu His Val Trp 
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850 855 860 

Val Pro Pro Leu Asn Val Arg Gly Gly Arg Asp Ala Val lie Leu Leu 
865 870 875 880 

Met Cys Val Val His Pro Thr Leu Val Phe Asp lie Thr Lys Leu Leu 
885 890 895 

Leu Ala lie Phe Gly Pro Leu Trp He Leu Gin Ala Ser Leu Leu Lys 
900 905 910 

Val Pro Tyr Phe Val Arg Val Gin Gly Leu Leu Arg lie Cys Ala Leu 
915 920 925 

Ala Arg Lys He Ala Gly Gly His Tyr Val Gin Met Ala He He Lys 
930 935 940 

Leu Gly Ala Leu Thr Gly Thr Tyr Val Tyr Asn His Leu Thr Pro Leu 
945 950 955 960 

Arg Asp Trp Ala His Asn Gly Leu Arg Asp Leu Ala Val Ala Val Glu 
965 970 975 

Pro Val Val Phe Ser Arg Met Glu Thr Lys Leu He Thr Trp Gly Ala 
980 985 990 

Asp Thr Ala Ala Cys Gly Asp He He Asn Gly Leu Pro Val Ser Ala 
995 1000 1005 

Arg Arg Gly Gin Glu He Leu Leu Gly Pro Ala Asp Gly Met Val Ser 
1010 1015 1020 

Lys Gly Trp Arg Leu Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Cys He He Thr Ser Leu Thr Gly Arg Asp Lys 
1045 1050 1055 

Asn Gin Val Glu Gly Glu Val Gin He Val Ser Thr Ala Thr Gin Thr 
1060 1065 1070 

Phe Leu Ala Thr Cys He Asn Gly Val Cys Trp Thr Val Tyr His Gly 
1075 1080 1085 

Ala Gly Thr Arg Thr He Ala Ser Pro Lys Gly Pro Val He Gin Met 
1090 1095 1100 

Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly 
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1105 1110 1115 1120 

Ser Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu 
1125 1130 1135 

Val Thr Arg His Ala Asp Val lie Pro Val Arg Arg Arg Gly Asp Ser 
1140 1145 1150 

Arg Gly Ser Leu Leu Ser Pro Arg Pro He Ser Tyr Leu Lys Gly Ser 
1155 1160 1165 

Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val Gly Leu Phe 
1170 1175 1180 

Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Asn Leu Gly Thr Thr Met Arg Ser Pro Val Phe Thr Asp 
1205 1210 1215 

Asn Ser Ser Pro Pro Ala Val Pro Gin Ser Phe Gin Val Ala His Leu 
1220 '1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr 
1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 
1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly Val Asp Pro 
1265 1270 1275 1280 

Asn He Arg Thr Gly Val Arg Thr He Thr Thr Gly Ser Pro He Thr 
1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly 
1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Thr Asp Ala Thr 
1315 1320 1325 

Ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 
1330 1335 1340 

Ala Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Val Ser His Pro Asn He Glu Glu Val Ala Leu Ser Thr Thr Gly Glu 
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1365 1370 1375 

lie Pro Phe Tyr Gly Lys Ala He Pro Leu Glu Val He Lys Gly Gly 
1380 1385 1390 

Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 
1395 1400 1405 

Ala Lys Leu Val Ala Leu Gly He Asn Ala Val Ala Tyr Tyr Arg Gly 
1410 1415 1420 

Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val Val Val Val Ser 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val He 
1445 1450 1455 

Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro 
1460 1465 1470 

Thr Phe Thr He Glu Thr Thr Thr Leu Pro Gin Asp Ala Val Ser Arg 
1475 1480 1485 

Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro Gly He Tyr Arg 
1490 1495 1500 

Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro 
1525 1530 1535 

Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met Asn Thr Pro Gly Leu 
1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Gly Val Phe Thr Gly 
1555 1560 1565 

Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 
1570 1575 1580 

Glu Asn Phe Pro Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin Met Trp Lys Cys Leu He 
1605 1610 1615 

Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu 
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1620 1625 1630 

Gly Ala Val Gin Asn Glu Val Thr Leu Thr His Pro lie Thr Lys Tyr 
1635 1640 1645 

lie Met Thr Cys Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp 
1650 1655 1660 

Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser 
1665 1670 1675 1680 

Thr Gly Cys Val Val He Val Gly Arg He Val Leu Ser Gly Lys Pro 
1685 1690 1695 

Ala He He Pro Asp Arg Glu Val Leu Tyr Gin Glu Phe Asp Glu Met 
1700 1705 1710 

Glu Glu Cys Ser Gin His Leu Pro Tyr He Glu Gin Gly Met Met Leu 
1715 1720 1725 

Ala Glu Gin Phe Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr Ala Ser 
1730 1735 1740 

Arg His Ala Glu Val He Thr Pro Ala Val Gin Thr Asn Trp Gin Lys 
1745 1750 1755 1760 

Leu Glu Val Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly He 
1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala He Ala 
1780 1785 1790 

Ser Leu Met Ala Phe Thr Ala Ala Val Thr Ser Pro Leu Thr Thr Gly 
1795 1800 1805 

Gin Thr Leu Leu Phe Asn He Leu Gly Gly Trp Val Ala Ala Gin Leu 
1810 1815 1820 

Ala Ala Pro Gly Ala Ala Thr Ala Phe Val Gly Ala Gly Leu Ala Gly 
1825 1830 1835 1840 

Ala Ala He Gly Ser Val Gly Leu Gly Lys Val Leu Val Asp He Leu 
1845 1850 1855 

Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu Val Ala Phe Lys lie 
1860 1865 1870 

Met Ser Gly Glu Val Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro 
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1875 1880 1885 

Ala lie Leu Ser Pro Gly Ala Leu Val Val Gly Val Val Cys Ala Ala 
1890 1895 1900 

lie Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr 
1925 1930 1935 

His Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg Val Thr Ala He Leu 
1940 1945 1950 

Ser Ser Leu Thr Val Thr Gin Leu Leu Arg Arg Leu His Gin Trp lie 
1955 1960 1965 

Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp He 
1970 1975 1980 

Trp Asp Trp He Cys Glu Val Leu Ser Asp Phe Lys Thr Trp Leu Lys 
1985 1990 1995 2000 

Ala Lys Leu Met Pro Gin Leu Pro Gly He Pro Phe Val Ser Cys Gin 
2005 2010 2015 

Arg Gly Tyr Arg Gly Val Trp Arg Gly Asp Gly He Met His Thr Arg 
2020 2025 2030 

Cys His Cys Gly Ala Glu He Thr Gly His Val Lys Asn Gly Thr Met 
2035 2040 2045 

Arg He Val Gly Pro Arg Thr Cys Arg Asn Met Trp Ser Gly Thr Phe 
2050 2055 2060 

Pro He Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro Leu Pro Ala Pro 
2065 2070 2075 2080 

Asn Tyr Lys Phe Ala Leu Trp Arg Val Ser Ala Glu Glu Tyr Val Glu 
2085 2090 2095 

He Arg Arg Val Gly Asp Phe His Tyr Val Ser Gly Met Thr Thr Asp 
2100 2105 2110 

Asn Leu Lys Cys Pro Cys Gin He Pro Ser Pro Glu Phe Phe Thr Glu 
2115 2120 2125 

Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro Pro Cys Lys Pro Leu 
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2130 2135 2140 

Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu His Glu Tyr Pro Val 

2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr 

2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His lie Thr Ala Glu Ala Ala Gly Arg 

2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Met Ala Ser Ser Ser Ala Ser 

2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Thr Ala Asn His Asp 

2210 2215 2220 

Ser Pro Asp Ala Glu Leu lie Glu Ala Asn Leu Leu Trp Arg Gin Glu 

2225 2230 2235 2240 

Met Gly Gly Asn lie Thr Arg Val Glu Ser Glu Asn Lys Val Val lie 

2245 2250 2255 

Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu Asp Glu Arg Glu Val 

2260 2265 2270 

Ser Val Pro Ala Glu lie Leu Arg Lys Ser Arg Arg Phe Ala Arg Ala 

2275 2280 2285 

Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Thr 

2290 2295 2300 

Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val His Gly Cys Pro Leu 

2305 2310 2315 2320 

Pro Pro Pro Arg Ser Pro Pro Val Pro Pro Pro Arg Lys Lys Arg Thr 

2325 2330 2335 

Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala Leu Ala Glu Leu Ala 

2340 2345 2350 

Thr Lys Ser Phe Gly Ser Ser Ser Thr Ser Gly lie Thr Gly Asp Asn 

2355 2360 2365 

Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys Pro Pro Asp Ser 

2370 2375 2380 

Asp Val Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
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2385 2390 2395 2400 

Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val Ser Ser Gly Ala 
2405 2410 2415 

Asp Thr Glu Asp Val Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly 
2420 2425 2430 

Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin Lys Leu Pro lie Asn 
2435 2440 2445 

Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Leu Val Tyr Ser Thr 
2450 2455 2460 

Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys Val Thr Phe Asp Arg 
2465 2470 2475 2480 

Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val Leu Lys Glu Val Lys 
2485 2490 2495 

Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu Ser Val Glu Glu Ala 
2500 2505 2510 

Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser Lys Phe Gly Tyr Gly 
2515 2520 2525 

Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala Val Ala His lie Asn 
2530 2535 2540 

Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Val Thr Pro lie Asp Thr 
2545 2550 2555 2560 

Thr lie Met Ala Lys Asn Glu Val Phe Cys v ^l Gin Pro Glu Lys Gly 
2565 2570 2575 

Gly Arg Lys Pro Ala Arg Leu lie Val Phe Pro Asp Leu Gly Val Arg 
2580 2585 2590 

Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Ser Lys Leu Pro Leu 
2595 2600 2605 

Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg 
2610 2615 2620 

Val Glu Phe Leu Val Gin Ala Trp Lys Ser Lys Lys Thr Pro Met Gly 
2625 2630 2635 2640 

Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser Asp 
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2645 2650 2655 

lie Arg Thr Glu Glu Ala lie Tyr Gin Cys Cys Asp Leu Asp Pro Gin 
2660 2665 2670 

Ala Arg Val Ala lie Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly 
2675 2680 2685 

Pro Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg 
2690 2695 2700 

Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr 
2705 2710 2715 2720 

lie Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr 
2725 2730 2735 

Met Leu Val Cys Gly Asp Asp Leu Val Val lie Cys Glu Ser Ala Gly 
2740 2745 2750 

Val Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr 
2755 2760 2765 

Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu 
2770 2775 2780 

Glu Leu lie Thr Ser Cys Ser Ser Asn Val Ser Val Ala His Asp Gly 
2785 2790 2795 2800 

Ala Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu 
2805 2810 2815 

Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro Val Asn Ser Trp 
2820 2825 2830 

Leu Gly Asn lie lie Met Phe Ala Pro Thr Leu Trp Ala Arg Met lie 
2835 2840 ^ 2845 

Leu Met Thr His Phe Phe Ser Val Leu lie Ala Arg Asp Gin Leu Glu 
2850 2855 2860 

Gin Ala Leu Asn Cys Glu lie Tyr Gly Ala Cys Tyr Ser lie Glu Pro 
2865 2870 2875 2880 

Leu Asp Leu Pro Pro lie lie Gin Arg Leu His Gly Leu Ser Ala Phe 
2885 2890 2895 

Ser Leu His Ser Tyr Ser Pro Gly Glu lie Asn Arg Val Ala Ala Cys 
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2900 2905 2910 

Leu Arg Lys Leu Gly Val Pro Pro Leu Arg Ala Trp Arg His Arg Ala 
2915 2920 2925 

Arg Ser Val Arg Ala Arg Leu Leu Ser Arg Gly Gly Arg Ala Ala lie 
2930 2935 2940 

Cys Gly Lys Tyr Leu Phe Asn Trp Ala Val Arg Thr Lys Leu Lys Leu 
2945 2950 2955 2960 

Thr Pro lie Ala Ala Ala Gly Arg Leu Asp Leu Ser Gly Trp Phe Thr 
2965 2970 2975 

Ala Gly Tyr Ser Gly Gly Asp lie Tyr His Ser Val Ser His Ala Arg 
2980 2985 2990 

Pro Arg Trp Phe Trp Phe Cys Leu Leu Leu Leu Ala Ala Gly Val Gly 
2995 3000 3005 

lie Tyr Leu Leu Pro Asn Arg 
3010 3015 



<210> 5 
<211> 9611 
<212> DNA 

<213> Hepatitis C virus 
<400> 5 

gccagccccc tgatgggggc gacactccac 
tcttcacgca gaaagcgtct agccatggcg 
cccccctccc gggagagcca tagtggtctg 
gaagactggg tcctttcttg gataaaccca 
gcaagactgc tagccgagta gcgttgggtt 
gtgcttgcga gtgccccggg aggtctcgta 
ctcaaagaaa aaccaaaaga aacaccaacc 
gcggccagat cgttggcgga gtatacttgt 
gcgcgacaag gaagacttcg gagcggtccc 
aagatcggcg ctccactggc aaatcctggg 
ggaatgaggg actcggctgg gcaggatggc 
ggggccccaa tgacccccgg cataggtcgc 
cgtgcggctt tgccgacctc atggggtaca 
tcgccagagc tctcgcgcat ggcgtgagag 
ggaacttacc cggttgctcc ttttctatct 
ccccggtctc cgctgccgaa gtgaagaaca 
gcaccaatga cagcattacc tggcagctcc 
tcccgtgcga gaaagtgggg aatgcatctc 



catgaatcac tcccctgtga ggaactactg 60 
ttagtatgag tgtcgtgcag cctccaggac 120 
cggaaccggt gagtacaccg gaattgccgg 180 
ctctatgccc ggccatttgg gcgtgccccc 240 
gcgaaaggcc ttgtggtact gcctgatagg 300 
gaccgtgcac catgagcaca aatcctaaac 360 
gtcgcccaca agacgttaag tttccgggcg 420 
tgccgcgcag gggccccagg ttgggtgtgc 4 80 
agccacgtgg aaggcgccag* cccatcccta 540 
gaaaaccagg atacccctgg cccctatacg 600 
tcctgtcccc ccgaggttcc cgtccctctt 660 
gcaacgtggg taaggtcatc gataccctaa 720 
tccctgtcgt gggcgccccg ctcggcggcg 780 
tcctggagga cggggttaat tttgcaacag 840 
tcttgctggc cctgctgtcc tgcatcacca 900 
tcagtaccgg ctacatggtg actaacgact 960 
aggctgctgt cctccacgtc cccgggtgcg 102 0 
agtgctggat accggtctca ccgaatgtgg 1080 
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ccgtgcagcg gcccggcgcc ctcacgcagg gcttgcggac gcacatcgac atggttgtga 1140 
tgtccgccac gctctgctct gccctctacg tgggggacct ctgcggtggg gtgatgctcg 1200 
cagcccaaat gttcattgtc tcgccgcagc accactggtt tgtccaagac tgcaattgct 1260 
ccatctaccc tggtaccatc actggacacc gcatggcatg ggacatgatg atgaactggt 1320 
cgcccacggc taccatgatc ttggcgtacg cgatgcgtgt ccccgaggtc attatagaca 1380 
tcattagcgg ggctcattgg ggcgtcatgt tcggcttggc ctacttctct atgcagggag 1440 
cgtgggcgaa agtcgttgtc atccttctgt tggccgccgg ggtggacgcg cgcacccata 1500 
ctgttggggg ttctgccgcg cagaccaccg ggcgcctcac cagcttattt gacatgggcc 1560 
ccaggcagaa aatccagctc gttaacacca atggcagctg gcacatcaac cgcaccgccc 1620 
tgaactgcaa tgactccttg cacaccggct ttatcgcgtc tctgttctac acccacagct 1680 
tcaactcgtc aggatgtccc gaacgcatgt ccgcctgccg cagtatcgag gccttccggg 1740 
tgggatgggg cgccttgcaa tatgaggata atgtcaccaa tccagaggat atgagaccct 1800 
attgctggca ctacccacca aggcagtgtg gcgtggtctc cgcgaagact gtgtgtggcc 1860 
cagtgtactg tttcaccccc agcccagtgg tagtgggcac gaccgacagg cttggagcgc 192 0 
ccacttacac gtggggggag aatgagacag atgtcttcct attgaacagc actcgaccac 1980 
cgctggggtc atggttcggc tgcacgtgga tgaactcttc tggctacacc aagacttgcg 204 0 
gcgcaccacc ctgccgtact agagctgact tcaacgccag cacggacctg ttgtgcccca 2100 
cggactgttt taggaagcat cctgatacca cttacctcaa atgcggctct gggccctggc 2160 
tcacgccaag gtgcctgatc gactacccct acaggctctg gcattacccc tgcacagtta 2220 
actataccat cttcaaaata aggatgtatg tgggaggggt tgagcacagg ctcacggctg 2280 
catgcaattt cactcgtggg gatcgttgca acttggagga cagagacaga agtcaactgt 234 0 
ctcctttgtt gcactccacc acggaatggg ccattttacc ttgctcttac tcggacctgc 2400 
ccgccttgtc gactggtctt ctccacctcc accaaaacat cgtggacgta caattcatgt 2460 
atggcctatc acctgccctc acaaaataca tcgtccgatg ggagtgggta atactcttat 252 0 
tcctgctctt agcggacgcc agggtttgcg cctgcttatg gatgctcatc ttgttgggcc 2580 
aggccgaagc agctttggag aacctcgtaa tactcaatgc agcatccctg gccgggacgc 2640 
acggtcttgt gtccttcctc gtgttcttct gctttgcgtg gtatctgaag ggtaggtggg 2700 
tgcccggagc ggtctacgcc ctctacggga tgtggcctct cctcctgctc ctgctggcgt 2760 
tgcctcagcg ggcatatgca ctggacacgg aggtggccgc gtcgtgtggc ggcgttgttc 2820 
ttgtcgggtt aatggcgctg actctgtcgc catattacaa gcgctatatc agctggtgca 2 8 80 
tgtggtggct tcagtatttt ctgaccagag tagaagcgca actgcacgtg tgggttcccc 294 0 
ccctcaacgt ccgggggggg cgcgatgccg tcatcttact catgtgtgta gtacacccga 3000 
ccctggtatt tgacatcacc aaactactcc tggccatctt cggacccctt tggattcttc 3060 
aagccagttt gcttaaagtc ccctacttcg tgcgcgttca aggccttctc cggatctgcg 3120 
cgctagcgcg gaagatagcc ggaggtcatt acgtgcaaat ggccatcatc aagttagggg 3180 
cgcttactgg cacctatgtg tataaccatc tcacccctct tcgagactgg gcgcacaacg 3240 
gcctgcgaga tctggccgtg gctgtggaac cagtcgtctt ctcccgaatg gagaccaagc 33 GO 
tcatcacgtg gggggcagat accgccgcgt gcggtgacat catcaacggc ttgcccgtct 3360 
ctgcccgtag gggccaggag atactgcttg ggccagccga cggaatggtc tccaaggggt 342 0 
ggaggttgct ggcgcccatc acggcgtacg cccagcagac gagaggcctc ctagggtgta 3480 
taatcaccag cctgactggc cgggacaaaa accaagtgga gggtgaggtc cagatcgtgt 3540 
caactgctac ccaaaccttc ctggcaacgt gcatcaatgg ggtatgctgg actgtctacc 3600 
acggggccgg aacgaggacc atcgcatcac ccaagggtcc tgtcatccag atgtatacca 3660 
atgtggacca agaccttgtg ggctggcccg ctcctcaagg ttcccgctca ttgacaccct 3720 
gtacctgcgg ctcctcggac ctttacctgg tcacgaggca cgccgatgtc attcccgtgc 3780 
gccggcgagg tgatagcagg ggtagcctgc tttcgccccg gcccatttcc tacttgaaag 3840 
gctcctcggg gggtccgctg ttgtgccccg cgggacacgc cgtgggccta ttcagggccg 3 900 
cggtgtgcac ccgtggagtg gctaaagcgg tggactttat ccctgtggag aacctaggga 3 960 



33 



WO 00/75338 ^^PCT/USOO/15446 

caaccatgag atccccggtg ttcacggaca actcctctcc accagcagtg ccccagagct 402 0 
tccaggtggc ccacctgcat gctcccaccg gcagcggtaa gagcaccaag gtcccggctg 4080 
cgtacgcagc ccagggctac aaggtgttgg tgctcaaccc ctctgttgct gcaacgctgg 4140 
gctttggtgc ttacatgtcc aaggcccatg gggttgatcc taatatcagg accggggtga 42 00 
gaacaattac cactggcagc cccatcacgt actccaccta cggcaagttc cttgccgacg 4260 
gcgggtgctc aggaggtgct tatgacataa taatttgtga cgagtgccac tccacggatg 4 320 
ccacatccat cttgggcatc ggcactgtcc ttgaccaagc agagactgcg ggggcgagac 43 80 
tggttgtgct cgccactgct acccctccgg gctccgtcac tgtgtcccat cctaacatcg 4440 
a ggaggttgc tctgtccacc accggagaga tcccctttta cggcaaggct atccccctcg 4500 
aggtgatcaa ggggggaaga catctcatct tctgccactc aaagaagaag tgcgacgagc 4560 
tcgccgcgaa gctggtcgca ttgggcatca atgccgtggc ctactaccgc ggtcttgacg 4 620 
tgtctgtcat cccgaccagc ggcgatgttg tcgtcgtgtc gaccgatgct ctcatgactg 4680 
gctttaccgg cgacttcgac tctgtgatag actgcaacac gtgtgtcact cagacagtcg 474 0 
atttcagcct tgaccctacc tttaccattg agacaaccac gctcccccag gatgctgtct 4 800 
ccaggactca acgccggggc aggactggca gggggaagcc aggcatctat agatttgtgg 4 860 
caccggggga gcgcccctcc ggcatgttcg actcgtccgt cctctgtgag tgctatgacg 492 0 
cgggctgtgc ttggtatgag ctcacgcccg ccgagactac agttaggcta cgagcgtaca 4 98 0 
tgaacacccc ggggcttccc gtgtgccagg accatcttga attttgggag ggcgtcttta 5040 
cgggcctcac tcatatagat gcccactttt tatcccagac aaagcagagt ggggagaact 5100 
ttccttacct ggtagcgtac caagccaccg tgtgcgctag ggctcaagcc cctcccccat 5160 
cgtgggacca gatgtggaag tgtttgatcc gccttaaacc caccctccat gggccaacac 5220 
ccctgctata cagactgggc gctgttcaga atgaagtcac cctgacgcac ccaatcacca 52 80 
aatacatcat gacatgcatg tcggccgacc tggaggtcgt cacgagcacc tgggtgctcg 534 0 
ttggcggcgt cctggctgct ctggccgcgt attgcctgtc aacaggctgc gtggtcatag 54 00 
tgggcaggat cgtcttgtcc gggaagccgg caattatacc tgacagggag gttctctacc 54 60 
aggagttcga tgagatggaa gagtgctctc agcacttacc gtacatcgag caagggatga 552 0 
tgctcgctga gcagttcaag cagaaggccc tcggcctcct gcagaccgcg tcccgccatg 5580 
cagaggttat cacccctgct gtccagacca actggcagaa actcgaggtc ttttgggcga 564 0 
agcacatgtg gaatttcatc agtgggatac aatacttggc gggcctgtca acgctgcctg 5700 
gtaaccccgc cattgcttca ttgatggctt ttacagctgc cgtcaccagc ccactaacca 5760 
ctggccaaac cctcctcttc aacatattgg gggggtgggt ggctgcccag ctcgccgccc 582 0 
ccggtgccgc tactgccttt gtgggtgctg gcctagctgg cgccgccatc ggcagcgttg 5880 
gactggggaa ggtcctcgtg gacattcttg cagggtatgg cgcgggcgtg gcgggagctc 5940 
ttgtagcatt caagatcatg agcggtgagg tcccctccac ggaggacctg gtcaatctgc 6000 
tgcccgccat cctctcgcct ggagcccttg tagtcggtgt ggtctgcgca gcaatactgc 6060 
gccggcacgt tggcccgggc gagggggcag tgcaatggat gaaccggcta atagccttcg 6120 
cctcccgggg gaaccatgtt tcccccacgc actacgtgcc ggagagcgat gcagccgccc 6180 
gcgtcactgc catactcagc agcctcactg taacccagct cctgaggcga ctgcatcagt 6240 
ggataagctc ggagtgtacc actccatgct ccggttcctg gctaagggac atctgggact 63 00 
ggatatgcga ggtgctgagc gactttaaga cctggctgaa agccaagctc atgccacaac 6360 
tgcctgggat tccctttgtg tcctgccagc gcgggtatag gggggtctgg cgaggagacg 6420 
gcattatgca cactcgctgc cactgtggag ctgagatcac tggacatgtc aaaaacggga 64 80 
cgatgaggat cgtcggtcct aggacctgca ggaacatgtg gagtgggacg ttccccatta 654 0 * 
acgcctacac cacgggcccc tgtactcccc ttcctgcgcc gaactataag ttcgcgctgt 6600 
ggagggtgtc tgcagaggaa tacgtggaga taaggcgggt gggggacttc cactacgtat 6660 
cgggtatgac tactgacaat cttaaatgcc cgtgccagat cccatcgccc gaatttttca 6720 
cagaattgga cggggtgcgc ctacacaggt ttgcgccccc ttgcaagccc ttgctgcggg 6780 
a 99 a 99t a tc attcagagta ggactccacg agtacccggt ggggtcgcaa ttaccttgcg 6840 
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agcccgaacc ggacgtagcc gtgttgacgt 
cagaggcggc cgggagaagg ttggcgagag 
ctagccagct gtccgctcca tctctcaagg 
acgccgagct catagaggct aacctcctgt 
gggttgagtc agagaacaaa gtggtgattc 
aggatgagcg ggaggtctcc gtacctgcag 
gggccctgcc cgtctgggcg cggccggact 
agcctgacta cgaaccacct gtggtccatg 
ctgtgcctcc gcctcggaaa aagcgtacgg 
ccttggccga gcttgccacc aaaagttttg 
acaatacgac aacatcctct gagcccgccc 
agtcctattc ttccatgccc cccctggagg 
ggtcatggtc gacggtcagt agtggggccg 
cttattcctg gacaggcgca ctcgtcaccc 
tcaacgcact gagcaactcg ttgctacgcc 
gcagtgcttg ccaaaggcag aagaaagtca 
attaccagga cgtgctcaag gaggtcaaag 
tatccgtaga ggaagcttgc agcctgacgc 
atggggcaaa agacgtccgt tgccatgcca 
ggaaagacct tctggaagac agtgtaacac 
aggttttctg cgttcagcct gagaaggggg 
ccgacctggg cgtgcgcgtg tgcgagaaga 
ccctggccgt gatgggaagc tcctacggat 
tcctcgtgca agcgtggaag tccaagaaga 
gttttgactc cacagtcact gagagcgaca 
gtgacctgga cccccaagcc cgcgtggcca 
ggggccctct taccaattca aggggggaaa 
gcgtactgac aactagctgt ggtaacaccc 
gtcgagccgc agggctccag gactgcacca 
tctgtgaaag tgcgggggtc caggaggacg 
tgaccaggta ctccgccccc cccggggacc 
taacatcatg ctcctccaac gtgtcagtcg 
accttacccg tgaccctaca acccccctcg 
ctccagtcaa ttcctggcta ggcaacataa 
tgatactgat gacccatttc tttagcgtcc 
ttaactgtga gatctacgga gcctgctact 
ttcaaagact ccatggcctc agcgcatttt 
atagggtggc cgcatgcctc agaaaacttg 
gggcccggag cgtccgcgct aggcttctgt 
agtacctctt caactgggca gtaagaacaa 
gccggctgga cttgtccggt tggttcacgg 
gcgtgtctca tgcccggccc cgctggttct 
taggcatcta cctcctcccc aaccgatgaa 
catttcctgt tttttttttt tttttttttt 
ttcttttttt cctttctttt tcccttcttt 
gctagctgtg aaaggtccgt gagccgcatg 
gcagatcatg t 



ccatgctcac tgatccctcc catataacag 6900 
ggtcaccccc ttctatggcc agctcctcgg 6960 
caacttgcac cgccaaccat gactcccctg 7020 
ggaggcagga gatgggcggc aacatcacca 7080 
tggactcctt cgatccgctt gtggcagagg 7140 
aaattctgcg gaagtctcgg agattcgccc 7200 
acaacccccc gctagtagag acgtggaaaa 7260 
gctgcccgct accacctcca cggtcccctc 7320 
tggtcctcac cgaatcaacc ctatctactg 7380 
gcagctcctc aacttccggc attacgggcg 7440 
cttctggctg cccccccgac tccgacgttg 7500 
gggagcctgg ggatccggat ctcagcgacg 7560 
acacggaaga tgtcgtgtgc tgctcaatgt 7620 
cgtgcgctgc ggaagaacaa aaactgccca 7680 
atcacaatct ggtgtattcc accacttcac 7740 
catttgacag actgcaagtt ctggacagcc 7800 
cagcggcgtc aaaagtgaag gctaacttgc 7860 
ccccacattc agccaaatcc aagtttggct 7920 
gaaaggccgt agcccacatc aactccgtgt 7980 
caatagacac taccatcatg gccaagaacg 804 0 
gtcgtaagcc agctcgtctc atcgtgttcc 8100 
tggccctgta cgacgtggtt agcaagctcc 8160 
tccaatactc accaggacag cgggttgaat 8220 
ccccgatggg gttctcgtat gatacccgct 82 80 
tccgtacgga ggaggcaatt taccaatgtt 834 0 
tcaagtccct cactgagagg ctttatgttg 8400 
actgcggcta ccgcaggtgc cgcgcgagcg 8460 
tcacttgcta catcaaggcc cgggcagcct 8520 
tgctcgtgtg tggcgacgac ttagtcgtta 8580 
cggcgagcct gagagccttc acggaggcta 864 0 
ccccacaacc agaatacgac ttggagctta 8700 
cccacgacgg cgctggaaag agggtctact 8760 
cgagagccgc gtgggagaca gcaagacaca 882 0 
tcatgtttgc ccccacactg tgggcgagga 8880 
tcatagccag ggatcagctt gaacaggctc 8940 
ccatagaacc actggatcta cctccaatca 9000 
cactccacag ttactctcca ggtgaaatca 9060 
gggtcccgcc cttgcgagct tggagacacc 9120 
ccagaggagg cagggctgct atatgtggca 9180 
agctcaaact cactccaata gcggccgctg 9240 
ctggctacag cgggggagac atttatcaca 9300 
ggttttgcct actcctgctc gctgcagggg 9360 
ggttggggta aacactccgg cctcttaagc 9420 
tttttttctt tttttttttc tttcctttcc 9480 
aatggtggct ccatcttagc cctagtcacg 954 0 
actgcagaga gtgctgatac tggcctctct 9600 

9611 
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<210> 6 
<211> 3015 
<212> PRT 

<213> Hepatitis C virus 
<400> 6 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val lie Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Val Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys lie Thr Thr Pro Val Ser Ala Ala 
180 185 190 

Glu Val Lys Asn lie Ser Thr Gly Tyr Met Val Thr Asn Asp Cys Thr 
195 200 205 

Asn Asp Ser lie Thr Trp Gin -Leu Gin Ala Ala Val Leu His Val Pro 
210 215 220 
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Gly Cys Val Pro Cys Glu Lys Val Gly Asn Ala Ser Gin Cys Trp lie 
225 230 235 240 

Pro Val Ser Pro Asn Val Ala Val Gin Arg Pro Gly Ala Leu Thr Gin 
245 250 255 

Gly Leu Arg Thr His lie Asp Met Val Val Met Ser Ala Thr Leu Cys 
260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 
275 280 285 

Gin Met Phe lie Val Ser Pro Gin His His Trp Phe Val Gin Asp Cys 
290 295 300 

Asn Cys Ser lie Tyr Pro Gly Thr lie Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met lie Leu Ala Tyr 
325 330 335 

Ala Met Arg Val Pro Glu Val lie lie Asp lie lie Ser Gly Ala His 
340 345 350 

Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 
355 360 365 

Ala Lys Val Val Val lie Leu Leu Leu Ala Ala Gly Val Asp Ala Arg 
370 375 380 

Thr His Thr Val Gly Gly Ser Ala Ala Gin Thr Thr Gly Arg Leu Thr 
385 390 395 400 

Ser Leu Phe Asp Met Gly Pro Arg Gin Lys lie Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu His Thr Gly Phe He Ala Ser Leu Phe Tyr Thr His Ser Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Met Ser Ala Cys Arg Ser He Glu Ala 
450 455 460 

Phe Arg Val Gly Trp Gly Ala Leu Gin Tyr Glu Asp Asn Val Thr Asn 
465 470 475 480 
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Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Arg Gin Cys 
485 490 495 

Gly Val Val Ser Ala Lys Thr Val Cys Gly Pro Val Tyr Cys Phe Thr 
500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Leu Gly Ala Pro Thr 
515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 
530 535 540 

Arg Pro Pro Leu Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Ser 
545 550 555 560 

Gly Tyr Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 
565 570 575 

Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 
580 585 590 

His Pro Asp Thr Thr Tyr Leu Lys Cys Gly Ser Gly Pro Trp Leu Thr 
595 600 605 

Pro Arg Cys Leu lie Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
610 615 620 

Thr Val Asn Tyr Thr lie Phe Lys lie Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 
645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 
660 665 670 

Thr Thr Glu Trp Ala lie Leu Pro Cys Ser Tyr Ser Asp Leu Pro Ala 
675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn lie Val Asp Val Gin 
690 695 700 

Phe Met Tyr Gly Leu Ser Pro Ala Leu Thr Lys Tyr He Val Arg Trp 
705 710 715 720 

Glu Trp Val He Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
725 730 735 
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Ala Cys Leu Trp Met Leu lie Leu Leu Gly Gin Ala Glu Ala Ala Leu 
740 745 750 

Glu Asn Leu Val lie Leu Asn Ala Ala Ser Leu Ala Gly Thr His Gly 
755 760 765 

Leu Val Ser Phe Leu Val Phe Phe Cys Phe Ala Trp Tyr Leu Lys Gly 
770 775 780 

Arg Trp Val Pro Gly Ala Val Tyr Ala Leu Tyr Gly Met Trp Pro Leu 
785 790 795 800 

Leu Leu Leu Leu Leu Ala Leu Pro Gin Arg Ala Tyr Ala Leu Asp Thr 
805 810 815 

Glu Val Ala Ala Ser Cys Gly Gly Val Val Leu Val Gly Leu Met Ala 
820 825 830 

Leu Thr Leu Ser Pro Tyr Tyr Lys Arg Tyr lie Ser Trp Cys Met Trp 
835 840 845 

Trp Leu Gin Tyr Phe Leu Thr Arg Val Glu Ala Gin Leu His Val Trp 
850 855 860 

Val Pro Pro Leu Asn Val Arg Gly Gly Arg Asp Ala Val lie Leu Leu 
865 870 875 880 

Met Cys Val Val His Pro Thr Leu Val Phe Asp lie Thr Lys Leu Leu 
885 890 895 

Leu Ala lie Phe Gly Pro Leu Trp lie Leu Gin Ala Ser Leu Leu Lys 
900 905 910 

Val Pro Tyr Phe Val Arg Val Gin Gly Leu Leu Arg life Cys Ala Leu 
915 920 925 

Ala Arg Lys lie Ala Gly Gly His Tyr Val Gin Met Ala He He Lys 
930 935 940 

Leu Gly Ala Leu Thr Gly Thr Tyr Val Tyr Asn His Leu Thr Pro Leu 
945 950 955 960 

Arg Asp Trp Ala His Asn Gly Leu Arg Asp Leu Ala Val Ala Val Glu 
965 970 975 

Pro Val Val Phe Ser Arg Met Glu Thr Lys Leu He Thr Trp Gly Ala 
980 985 990 
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Asp Thr Ala Ala Cys Gly Asp lie lie Asn Gly Leu Pro Val Ser Ala 
995 1000 1005 

Arg Arg Gly Gin Glu lie Leu Leu Gly Pro Ala Asp Gly Met Val Ser 
1010 1015 1020 

Lys Gly Trp Arg Leu Leu Ala Pro lie Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Cys lie lie Thr Ser Leu Thr Gly Arg Asp Lys 
1045 1050 1055 

Asn Gin Val Glu Gly Glu Val Gin lie Val Ser Thr Ala Thr Gin Thr 
1060 1065 1070 

Phe Leu Ala Thr Cys lie Asn Gly Val Cys Trp Thr Val Tyr His Gly 
1075 1080 1085 

Ala Gly Thr Arg Thr lie Ala Ser Pro Lys Gly Pro Val lie Gin Met 
1090 1095 1100 

Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly 
1105 1110 1115 1120 

Ser Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu 
1125 1130 1135 

Val Thr Arg His Ala Asp Val lie Pro Val Arg Arg Arg Gly Asp Ser 
1140 1145 1150 

Arg Gly Ser Leu Leu Ser Pro Arg Pro lie Ser Tyr Leu Lys Gly Ser 
1155 1160 1165 

Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala val Gly Leu Phe 
1170 1175 1180 

Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe lie 
1185 1190 1195 1200 

Pro Val Glu Asn Leu Gly Thr Thr Met Arg Ser Pro Val Phe Thr Asp 
1205 1210 1215 

Asn Ser Ser Pro Pro Ala Val Pro Gin Ser Phe Gin Val Ala His Leu 
1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr 
1235 1240 1245 
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Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 
1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly Val Asp Pro 
1265 1270 1275 1280 

Asn lie Arg Thr Gly Val Arg Thr lie Thr Thr Gly Ser Pro lie Thr 
1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly 
1300 1305 1310 

Ala Tyr Asp lie lie lie Cys Asp Glu Cys His Ser Thr Asp Ala Thr 
1315 1320 1325 

Ser lie Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 
1330 1335 1340 

Ala Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Val Ser His Pro Asn He Glu Glu Val Ala Leu Ser Thr Thr Gly Glu 
1365 1370 1375 

He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu Val He Lys Gly Gly 
1380 1385 1390 



Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 
1395 1400 1405 

Ala Lys Leu Val Ala Leu Gly He Asn Ala Val Ala Tyr Tyr Arg Gly 
1410 1415 1420 

Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val Val Val Val Ser 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val He 
1445 1450 1455 

Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro 
1460 1465 1470 

Thr Phe Thr He Glu Thr Thr Thr Leu Pro Gin Asp Ala Val Ser Arg 
1475 1480 1485 

Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro Gly He Tyr Arg 
1490 1495 1500 
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Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro 
1525 1530 1535 

Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met Asn Thr Pro Gly Leu 
1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Gly Val Phe Thr Gly 
1555 1560 1565 

Leu Thr His lie Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 
1570 1575 1580 

Glu Asn Phe Pro Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin Met Trp Lys Cys Leu lie 
1605 1610 1615 

Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu 
1620 1625 1630 

Gly Ala Val Gin Asn Glu Val Thr Leu Thr His Pro lie Thr Lys Tyr 
1635 1640 1645 

lie Met Thr Cys Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp 
1650 1655 1660 

Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser 
1665 1670 1675 1680 

Thr Gly Cys Val Val lie Val Gly Arg lie Val Leu Ser Gly Lys Pro 
1685 1690 1695 

Ala lie lie Pro Asp Arg Glu Val Leu Tyr Gin Glu Phe Asp Glu Met 
1700 1705 1710 

Glu Glu Cys Ser Gin His Leu Pro Tyr lie Glu Gin Gly Met Met Leu 
1715 1720 1725 

Ala Glu Gin Phe Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr Ala Ser 
1730 1735 1740 

Arg His Ala Glu Val He Thr Pro Ala Val Gin Thr Asn Trp Gin Lys 
1745 1750 1755 1760 
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Leu Glu Val Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly He 
1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala He Ala 
1780 1785 1790 

Ser Leu Met Ala Phe Thr Ala Ala Val Thr Ser Pro Leu Thr Thr Gly 
1795 1800 1805 

Gin Thr Leu Leu Phe Asn He Leu Gly Gly Trp Val Ala Ala Gin Leu 
1810 1815 1820 

Ala Ala Pro Gly Ala Ala Thr Ala Phe Val Gly Ala Gly Leu Ala Gly 
1825 1830 1835 1840 

Ala Ala He Gly Ser Val Gly Leu Gly Lys Val Leu Val Asp He Leu 
1845 1850 1855 

Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu Val Ala Phe Lys He 
1860 1865 1870 

Met Ser Gly Glu Val Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro 
1875 1880 1885 

Ala He Leu Ser Pro Gly Ala Leu Val Val Gly Val Val Cys Ala Ala 
1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr 
1925 1930 1935 

His Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg Val Thr Ala He Leu 
1940 1945 1950 

Ser Ser Leu Thr Val Thr Gin Leu Leu Arg Arg Leu His Gin Trp He 
1955 1960 1965 

Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp He 
1970 1975 1980 

Trp Asp Trp He Cys Glu Val Leu Ser Asp Phe Lys Thr Trp Leu Lys 
1985 1990 1995 2000 

Ala Lys Leu Met Pro Gin Leu Pro Gly He Pro Phe Val Ser Cys Gin 
2005 2010 2015 
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Arg Gly Tyr Arg Gly Val Trp Arg Gly Asp Gly lie Met His Thr Arg 
2020 2025 2030 

Cys His Cys Gly Ala Glu lie Thr Gly His Val Lys Asn Gly Thr Met 
2035 2040 2045 

Arg lie Val Gly Pro Arg Thr Cys Arg Asn Met Trp Ser Gly Thr Phe 
2050 2055 2060 

Pro lie Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro Leu Pro Ala Pro 
2065 2070 2075 2080 

Asn Tyr Lys Phe Ala Leu Trp Arg Val Ser Ala Glu Glu Tyr Val Glu 
2085 2090 2095 

lie Arg Arg Val Gly Asp Phe His Tyr Val Ser Gly Met Thr Thr Asp 
2100 2105 2110 

Asn Leu Lys Cys Pro Cys Gin lie Pro Ser Pro Glu Phe Phe Thr Glu 
2115 2120 2125 

Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro Pro Cys Lys Pro Leu 
2130 2135 2140 

Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu His Glu Tyr Pro Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr 
2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His lie Thr Ala Glu Ala Ala Gly Arg 
2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Met Ala Ser Ser Ser Ala Ser 
2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Thr Ala Asn His Asp 
2210 2215 2220 

Ser Pro Asp Ala Glu Leu lie Glu Ala Asn Leu Leu Trp Arg Gin Glu 
2225 2230 2235 2240 

Met Gly Gly Asn lie Thr Arg Val Glu Ser Glu Asn Lys Val Val lie 
2245 2250 2255 

Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu Asp Glu Arg Glu Val 
2260 2265 2270 
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Ser Val Pro Ala Glu lie Leu Arg Lys Ser Arg Arg Phe Ala Arg Ala 
2275 2280 2285 

Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Thr 
2290 2295 2300 

Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val His Gly Cys Pro Leu 
2305 2310 2315 2320 

Pro Pro Pro Arg Ser Pro Pro Val Pro Pro Pro Arg Lys Lys Arg Thr 
2325 2330 2335 

Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala Leu Ala Glu Leu Ala 
2340 2345 2350 

Thr Lys Ser Phe Gly Ser Ser Ser Thr Ser Gly lie Thr Gly Asp Asn 
2355 2360 2365 

Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys Pro Pro Asp Ser 
2370 2375 2380 

Asp Val Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
2385 2390 2395 2400 

Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val Ser Ser Gly Ala 
2405 2410 2415 

Asp Thr Glu Asp Val Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly 
2420 2425 2430 

Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin Lys Leu Pro lie Asn 
2435 2440 2445 

Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Leu Val Tyr Ser Thr 
2450 2455 2460 

Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys Val Thr Phe Asp Arg 
2465 2470 2475 2480 

Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val Leu Lys Glu Val Lys 
2485 2490 2495 

Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu Ser Val Glu Glu Ala 
2500 2505 2510 

Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser Lys Phe Gly Tyr Gly 
2515 2520 2525 
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Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala Val Ala His lie Asn 
2530 2535 2540 

Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Val Thr Pro lie Asp Thr 
2545 2550 2555 2560 

Thr lie Met Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu Lys Gly 
2565 2570 2575 

Gly Arg Lys Pro Ala Arg Leu lie Val Phe Pro Asp Leu Gly Val Arg 
2580 2585 2590 

Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Ser Lys Leu Pro Leu 
2595 2600 2605 



Ala Val Met Gly Ser Ser Tyr Gly 
2610 2615 

Val Glu Phe Leu Val Gin Ala Trp 
2625 2630 

Phe Ser Tyr Asp Thr Arg Cys Phe 
2645 



Phe Gin Tyr Ser Pro Gly Gin Arg 
2620 

Lys Ser Lys Lys Thr Pro Met Gly 
2635 2640 

Asp Ser Thr Val Thr Glu Ser Asp 
2650 2655 



lie Arg Thr Glu Glu Ala lie Tyr Gin Cys Cys Asp Leu Asp Pro Gin 
2660 2665 2670 

Ala Arg Val Ala lie Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly 
2675 2680 2685 

Pro Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg 
2690 2695 2700 

Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr 
2705 2710 2715 2720 

lie Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr 
2725 2730 2735 

Met Leu Val Cys Gly Asp Asp Leu Val Val He Cys Glu Ser Ala Gly 
2740 2745 2750 

Val Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr 
2755 2760 2765 

Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu 
2770 2775 2780 
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Glu Leu lie Thr Ser Cys Ser Ser Asn Val Ser Val Ala His Asp Gly 
2785 2790 2795 2800 

Ala Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu 
2805 2810 2815 

Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro Val Asn Ser Trp 
2820 2825 2830 

Leu Gly Asn lie lie Met Phe Ala Pro Thr Leu Trp Ala Arg Met lie 
2835 2840 2845 

Leu Met Thr His Phe Phe Ser Val Leu lie Ala Arg Asp Gin Leu Glu 
2850 2855 2860 

Gin Ala Leu Asn Cys Glu lie Tyr Gly Ala Cys Tyr Ser lie Glu Pro 
2865 2870 2875 2880 

Leu Asp Leu Pro Pro lie lie Gin Arg Leu His Gly Leu Ser Ala Phe 
2885 2890 2895 

Ser Leu His Ser Tyr Ser Pro Gly Glu He Asn Arg Val Ala Ala Cys 
2900 2905 2910 

Leu Arg Lys Leu Gly Val Pro Pro Leu Arg Ala Trp Arg His Arg Ala 
2915 2920 2925 

Arg Ser Val Arg Ala Arg Leu Leu Ser Arg Gly Gly Arg Ala Ala He 
2930 2935 2940 

Cys Gly Lys Tyr Leu Phe Asn Trp Ala Val Arg Thr Lys Leu Lys Leu 
2945 2950 2955 2960 

Thr Pro He Ala Ala Ala Gly Arg Leu Asp Leu Ser Gly Trp Phe Thr 
2965 2970 2975 

Ala Gly Tyr Ser Gly Gly Asp He Tyr His Ser Val Ser His Ala Arg 
2980 2985 2990 

Pro Arg Trp Phe Trp Phe Cys Leu Leu Leu Leu Ala Ala Gly Val Gly 
2995 3000 3005 

He Tyr Leu Leu Pro Asn Arg 
3010 3015 



<210> 7 
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<211> 9611 
<212> DNA 

<213> Hepatitis C virus 
<400> 7 

gccagccccc tgatgggggc gacactccac catgaatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gataaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcaagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcaca aatcctaaac 360 
ctcaaagaaa aaccaaaaga aacaccaacc gtcgcccaca agacgttaag tttccgggcg 420 
gcggccagat cgttggcgga gtatacttgt tgccgcgcag gggccccagg ttgggtgtgc 480 
gcgcgacaag gaagacttcg gagcggtccc agccacgtgg aaggcgccag cccatcccta 540 
aagatcggcg ctccactggc aaatcctggg gaaaaccagg atacccctgg cccctatacg 600 
ggaatgaggg actcggctgg gcaggatggc tcctgtcccc ccgaggttcc cgtccctctt 660 
ggggccccaa tgacccccgg cataggtcgc gcaacgtggg taaggtcatc gataccctaa 72 0 
cgtgcggctt tgccgacctc atggggtaca tccctgtcgt gggcgccccg ctcggcggcg 780 
tcgccagagc tctcgcgcat ggcgtgagag tcctggagga cggggttaat tttgcaacag 84 0 
ggaacttacc cggttgctcc ttttctatct tcttgctggc cctgctgtcc tgcatcacca 900 
ccccggtctc cgctgccgaa gtgaagaaca tcagtaccgg ctacatggtg actaacgact 960 
gcaccaatga cagcattacc tggcagctcc aggctgctgt cctccacgtc cccgggtgcg 1020 
tcccgtgcga gaaagtgggg aatgcatctc agtgctggat accggtctca ccgaatgtgg 1080 
ccgtgcagcg gcccggcgcc ctcacgcagg gcttgcggac gcacatcgac atggttgtga 1140 
tgtccgccac gctctgctct gccctctacg tgggggacct ctgcggtggg gtgatgctcg 1200 
cagcccaaat gttcattgtc tcgccgcagc accactggtt tgtccaagac tgcaattgct 1260 
ccatctaccc tggtaccatc * actggacacc gcatggcatg ggacatgatg atgaactggt 1320 
cgcccacggc taccatgatc ttggcgtacg cgatgcgtgt ccccgaggtc attatagaca 13 80 
tcattagcgg ggctcattgg ggcgtcatgt tcggcttggc ctacttctct atgcagggag 1440 
cgtgggcgaa agtcgttgtc atccttctgt tggccgccgg ggtggacgcg cgcacccata 1500 
ctgttggggg ttctgccgcg cagaccaccg ggcgcctcac cagcttattt gacatgggcc 1560 
ccaggcagaa aatccagctc gttaacacca atggcagctg gcacatcaac cgcaccgccc 1620 
tgaactgcaa tgactccttg cacaccggct ttatcgcgtc tctgttctac acccacagct 1680 
tcaactcgtc aggatgtccc gaacgcatgt ccgcctgccg cagtatcgag gccttccggg 1740 
tgggatgggg cgccttgcaa tatgaggata atgtcaccaa tccagaggat atgagaccct 1800 
attgctggca ctacccacca aggcagtgtg gcgtggtctc cgcgaagact gtgtgtggcc 1860 
cagtgtactg tttcaccccc agcccagtgg tagtgggcac gaccgacagg cttggagcgc 1920 
ccacttacac gtggggggag aatgagacag atgtcttcct attgaacagc actcgaccac 1980 
cgctggggtc atggttcggc tgcacgtgga tgaactcttc tggctacacc aagacttgcg 2 040 
gcgcaccacc ctgccgtact agagctgact tcaacgccag cacggacctg ttgtgcccca 2100 
cggactgttt taggaagcat cctgatacca cttacctcaa atgcggctct gggccctggc 2160 
tcacgccaag gtgcctgatc gactacccct acaggctctg gcattacccc tgcacagtta 2220 
actataccat cttcaaaata aggatgtatg tgggaggggt tgagcacagg ctcacggctg 2280 
catgcaattt cactcgtggg gatcgttgca acttggagga cagagacaga agtcaactgt 2 34 0 
ctcctttgtt gcactccacc acggaatggg ccattttacc ttgctcttac tcggacctgc 24 00 
ccgccttgtc gactggtctt ctccacctcc accaaaacat cgtggacgta caattcatgt 24 60 
atggcctatc acctgccctc acaaaataca tcgtccgatg ggagtgggta atactcttat 2520 
tcctgctctt agcggacgcc agggtttgcg cctgcttatg gatgctcatc ttgttgggcc 2580 
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aggccgaagc agcactagag aagctggtca tcttgcacgc tgcgagcgca gctagctgca 2640 
atggcttcct atattttgtc atctttttcg tggctgcttg gtacatcaag ggtcgggtag 2700 
tccccttagc tacctattcc ctcactggcc tgtggtcctt tagcctactg ctcctagcat 2760 
tgccccaaca ggcatatgca ctggacacgg aggtggccgc gtcgtgtggc ggcgttgttc 2820 
ttgtcgggtt aatggcgctg actctgtcgc catattacaa gcgctatatc agctggtgca 2880 
tgtggtggct tcagtatttt ctgaccagag tagaagcgca actgcacgtg tgggttcccc 2940 
ccctcaacgt ccgggggggg cgcgatgccg t cat ct tact catgtgtgta gtacacccga 3000 
ccctggtatt tgacatcacc aaactactcc tggccatctt cggacccctt tggattcttc 3060 
aagccagttt gcttaaagtc ccctacttcg tgcgcgttca aggccttctc cggatctgcg 3120 
cgctagcgcg gaagatagcc ggaggtcatt acgtgcaaat ggccatcatc aagttagggg 3180 
cgcttactgg cacctatgtg tataaccatc tcacccctct tcgagactgg gcgcacaacg 3240 
gcctgcgaga tctggccgtg gctgtggaac cagtcgtctt ctcccgaatg gagaccaagc 3300 
tcatcacgtg gggggcagat accgccgcgt gcggtgacat catcaacggc ttgcccgtct 3360 
ctgcccgtag gggccaggag atactgcttg ggccagccga cggaatggtc tccaaggggt 3420 
ggaggttgct ggcgcccatc acggcgtacg cccagcagac gagaggcctc ctagggtgta 34 80 
taatcaccag cctgactggc cgggacaaaa accaagtgga gggtgaggtc cagatcgtgt 3 540 
caactgctac ccaaaccttc ctggcaacgt gcatcaatgg ggtatgctgg actgtctacc 3600 
acggggccgg aacgaggacc atcgcatcac ccaagggtcc tgtcatccag atgtatacca 3660 
atgtggacca agaccttgtg ggctggcccg ctcctcaagg ttcccgctca ttgacaccct 3720 
gtacctgcgg ctcctcggac ctttacctgg tcacgaggca cgccgatgtc attcccgtgc 3780 
gccggcgagg tgatagcagg ggtagcctgc tttcgccccg gcccatttcc tactcgaaag 3 84 0 
gctcctcggg gggtccgctg ttgtgccccg cgggacacgc cgtgggccta ttcagggccg 3900 
cggtgtgcac ccgtggagtg gctaaagcgg tggactttat ccctgtggag aacctaggga 3960 
caaccatgag atccccggtg ttcacggaca actcctctcc accagcagtg ccccagagct 4 020 
tccaggtggc ccacctgcat gctcccaccg gcagcggtaa gagcaccaag gtcccggctg 4080 
cgtacgcagc ccagggctac aaggtgttgg tgctcaaccc ctctgttgct gcaacgctgg 4140 
gctttggtgc ttacatgtcc aaggcccatg gggttgatcc taatatcagg accggggtga 4200 
gaacaattac cactggcagc cccatcacgt actccaccta cggcaagttc cttgccgacg 4260 
gcgggtgctc aggaggtgct tatgacataa taatttgtga cgagtgccac tccacggatg 4320 
ccacatccat cttgggcatc ggcactgtcc ttgaccaagc agagactgcg ggggcgagac 4380 
tggttgtgct cgccactgct acccctccgg gctccgtcac tgtgtcccat cctaacatcg 4440 
aggaggttgc tctgtccacc accggagaga tcccctttta cggcaaggct atccccctcg 4500 
aggtgatcaa ggggggaaga catctcatct tctgccactc aaagaagaag tgcgacgagc 4560 
tcgccgcgaa gctggtcgca ttgggcatca atgccgtggc ctactaccgc ggtcttgacg 4620 
tgtctgtcat cccgaccagc ggcgatgttg tcgtcgtgtc gaccgatgct ctcatgactg 4680 
gctttaccgg cgacttcgac tctgtgatag ^ctgcaacac gtgtgtcact cagacagtcg 4740 
atttcagcct tgaccctacc tttaccattg agacaaccac gctcccccag gatgctgtct 4 800 
ccaggactca acgccggggc aggactggca gggggaagcc aggcatctat agatttgtgg 4 860 
caccggggga gcgcccctcc ggcatgttcg actcgtccgt cctctgtgag tgctatgacg 4 920 
cgggctgtgc ttggtatgag ctcacgcccg ccgagactac agttaggcta cgagcgtaca 4980 
tgaacacccc ggggcttccc gtgtgccagg accatcttga attttgggag ggcgtcttta 504 0 
cgggcctcac tcatatagat gcccactttt tatcccagac aaagcagagt ggggagaact 5100 
ttccttacct ggtagcgtac caagccaccg tgtgcgctag ggctcaagcc cctcccccat 5160 
cgtgggacca gatgtggaag tgtttgatcc gccttaaacc caccctccat gggccaacac 522 0 
ccctgctata cagactgggc gctgttcaga atgaagtcac cctgacgcac ccaatcacca 5280 
aatacatcat gacatgcatg tcggccgacc tggaggtcgt cacgagcacc tgggtgctcg 534 0 
ttggcggcgt cctggctgct ctggccgcgt attgcctgtc aacaggctgc gtggtcatag 54 00 
tgggcaggat cgtcttgtcc gggaagccgg caattatacc tgacagggag gttctctacc 54 60 
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a 9gagttcga tgagatggaa gagtgctctc 
tgctcgctga gcagttcaag cagaaggccc 
cagaggttat cacccctgct gtccagacca 
agcacatgtg gaatttcatc agtgggatac 
gtaaccccgc cattgcttca ttgatggctt 
ctggccaaac cctcctcttc aacatattgg 
ccggtgccgc tactgccttt gtgggtgctg 
gactggggaa ggtcctcgtg gacattcttg 
ttgtagcatt caagatcatg agcggtgagg 
tgcccgccat cctctcgcct ggagcccttg 
gccggcacgt tggcccgggc gagggggcag 
cctcccgggg gaaccatgtt tcccccacgc 
gcgtcactgc catactcagc agcctcactg 
ggataagctc ggagtgtacc actccatgct 
ggatatgcga ggtgctgagc gactttaaga 
tgcctgggat tccctttgtg tcctgccagc 
gcattatgca cactcgctgc cactgtggag 
cgatgaggat cgtcggtcct aggacctgca 
acgcctacac cacgggcccc tgtactcccc 
ggagggtgtc tgcagaggaa tacgtggaga 
cgggtatgac tactgacaat cttaaatgcc 
cagaattgga cggggtgcgc ctacacaggt 
aggaggtatc attcagagta ggactccacg 
agcccgaacc ggacgtagcc gtgttgacgt 
cagaggcggc cgggagaagg ttggcgagag 
ctagccagct gtccgctcca tctctcaagg 
acgccgagct catagaggct aacctcctgt 
gggttgagtc agagaacaaa gtggtgattc 
aggatgagcg ggaggtctcc gtacctgcag 
gggccctgcc cgtctgggcg cggccggact 
agcctgacta cgaaccacct gtggtccatg 
ctgtgcctcc gcctcggaaa aagcgtacgg 
ccttggccga gcttgccacc aaaagttttg 
acaatacgac aacatcctct gagcccgccc 
agtcctattc ttccatgccc cccctggagg 
ggtcatggtc gacggtcagt agtggggccg 
cttattcctg gacaggcgca ctcgtcaccc 
tcaacgcact gagcaactcg ttgctacgcc 
gcagtgcttg ccaaaggcag aagaaagtca 
attaccagga cgtgctcaag gaggtcaaag 
tatccgtaga ggaagcttgc agcctgacgc 
atggggcaaa agacgtccgt tgccatgcca 
ggaaagacct tctggaagac agtgtaacac 
aggttttctg cgttcagcct gagaaggggg 
ccgacctggg cgtgcgcgtg tgcgagaaga 
ccctggccgt gatgggaagc tcctacggat 
tcctcgtgca agcgtggaag tccaagaaga 
gttttgactc cacagtcact gagagcgaca 



agcacttacc gtacatcgag caagggatga 552 0 
tcggcctcct gcagaccgcg tcccgccatg 5580 
actggcagaa actcgaggtc ttttgggcga 564 0 
aatacttggc gggcctgtca acgctgcctg 5700 
ttacagctgc cgtcaccagc ccactaacca 5760 
gggggtgggt ggctgcccag ctcgccgccc 5820 
gcctagctgg cgccgccatc ggcagcgttg 5880 
cagggtatgg cgcgggcgtg gcgggagctc 594 0 
tcccctccac ggaggacctg gtcaatctgc 6000 
tagtcggtgt ggtctgcgca gcaatactgc 6060 
tgcaatggat gaaccggcta atagccttcg 6120 
actacgtgcc ggagagcgat gcagccgccc 6180 
taacccagct cctgaggcga ctgcatcagt 6240 
ccggttcctg gctaagggac atctgggact 6300 
cctggctgaa agccaagctc atgccacaac 6360 
gcgggtatag gggggtctgg cgaggagacg 6420 
ctgagatcac tggacatgtc aaaaacggga 64 80 
ggaacatgtg gagtgggacg ttccccatta 654 0 
ttcctgcgcc gaactataag ttcgcgctgt 6600 
taaggcgggt gggggacttc cactacgtat 6660 
cgtgccagat cccatcgccc gaatttttca 6720 
ttgcgccccc ttgcaagccc ttgctgcggg 6780 
agtacccggt ggggtcgcaa ttaccttgcg 6840 
ccatgctcac tgatccctcc catataacag 6900 
ggtcaccccc ttctatggcc agctcctcgg 6960 
caacttgcac cgccaaccat gactcccctg 7020 
ggaggcagga gatgggcggc aacatcacca 7080 
tggactcctt cgatccgctt gtggcagagg 714 0 
aaattctgcg gaagtctcgg agattcgccc 7200 
acaacccccc gctagtagag acgtggaaaa 7260 
gctgcccgct accacctcca cggtcccctc 7320 
tggtcctcac cgaatcaacc ctatctactg 73 80 
gcagctcctc aacttccggc attacgggcg 744 0 
cttctggctg cccccccgac tccgacgttg 7500 
gggagcctgg ggatccggat ctcagcgacg 7560 
acacggaaga tgtcgtgtgc tgctcaatgt 7620 
cgtgcgctgc ggaagaacaa aaactgccca 7680 
atcacaatct ggtgtattcc accacttcac 7740 
catttgacag actgcaagtt ctggacagcc 7800 
cagcggcgtc aaaagtgaag gctaacttgc 7860 
ccccacattc agccaaatcc aagtttggct 7 920 
gaaaggccgt agcccacatc aactccgtgt 7980 
caatagacac taccatcatg gccaagaacg 8040 
gtcgtaagcc agctcgtctc atcgtgttcc 8100 
tggccctgta cgacgtggtt agcaagctcc 8160 
tccaatactc accaggacag cgggttgaat 8220 
ccccgatggg gttctcgtat gatacccgct 8280 
tccgtacgga ggaggcaatt taccaatgtt 834 0 



50 



WO 00/75338 ^^PCT/USOO/15446 

gtgacctgga cccccaagcc cgcgtggcca tcaagtccct cactgagagg ctttatgttg 8400 

ggggccctct taccaattca aggggggaaa actgcggcta ccgcaggtgc cgcgcgagcg 84 60 

gcgtactgac aactagctgt ggtaacaccc tcacttgcta catcaaggcc cgggcagcct 8520 

gtcgagccgc agggctccag gactgcacca tgctcgtgtg tggcgacgac ttagtcgtta 8580 

tctgtgaaag tgcgggggtc caggaggacg cggcgagcct gagagccttc acggaggcta 8640 

tgaccaggta ctccgccccc cccggggacc ccccacaacc agaatacgac ttggagctta 8700 

taacatcatg ctcctccaac gtgtcagtcg cccacgacgg cgctggaaag agggtctact 8760 

accttacccg tgaccctaca acccccctcg cgagagccgc gtgggagaca gcaagacaca 8820 

ctccagtcaa ttcctggcta ggcaacataa tcatgtttgc ccccacactg tgggcgagga 8880 

tgatactgat gacccatttc tttagcgtcc tcatagccag ggatcagctt gaacaggctc 8940 

ttaactgtga gatctacgga gcctgctact ccatagaacc actggatcta cctccaatca 9000 

ttcaaagact ccatggcctc agcgcatttt cactccacag ttactctcca ggtgaaatca 9060 

atagggtggc cgcatgcctc agaaaacttg gggtcccgcc cttgcgagct tggagacacc 9120 

gggcccggag cgtccgcgct aggcttctgt ccagaggagg cagggctgct atatgtggca 9180 

agtacctctt caactgggca gtaagaacaa agctcaaact cactccaata gcggccgctg 9240 

gccggctgga cttgtccggt tggttcacgg ctggctacag cgggggagac atttatcaca 9300 

gcgtgtctca tgcccggccc cgctggttct ggttttgcct actcctgctc gctgcagggg 9360 

taggcatcta cctcctcccc aaccgatgaa ggttggggta aacactccgg cctcttaagc 94 20 

catttcctgt tttttttttt tttttttttt tttttttctt tttttttttc tttcctttcc 9480 

ttcttttttt cctttctttt tcccttcttt aatggtggct ccatcttagc cctagtcacg 9540 

gctagctgtg aaaggtccgt gagccgcatg actgcagaga gtgctgatac tggcctctct 9600 

gcagatcatg t 9611 



<210> 8 
<211> 3015 
<212> PRT 

<213> Hepatitis C virus 
<400> 8 

Met Ser Thr Asn Pro Lys Pro Gin Axg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 
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Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Val Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys He Thr Thr Pro Val Ser Ala Ala 
180 185 190 

Glu Val Lys . Asn He Ser Thr Gly Tyr Met Val Thr Asn Asp Cys Thr 
195 200 205 

Asn Asp Ser He Thr Trp Gin Leu Gin Ala Ala Val Leu His Val Pro 
210 215 220 

Gly Cys Val Pro Cys Glu Lys Val Gly Asn Ala Ser Gin Cys Trp He 
225 230 235 240 

Pro Val Ser Pro Asn Val Ala Val Gin Arg Pro Gly Ala Leu Thr Gin 
245 250 255 

Gly Leu Arg Thr His He Asp Met Val Val Met Ser Ala Thr Leu Cys 
260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 
275 280 285 

Gin Met Phe He Val Ser Pro Gin His His Trp Phe Val Gin Asp Cys 
290 295 300 

Asn Cys Ser He Tyr Pro Gly Thr He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met He Leu Ala Tyr 
325 330 335 

Ala Met Arg Val Pro Glu Val He He Asp He He Ser Gly Ala His 
340 345 350 



52 



WO 00/75338 PCT/US00/15446 

Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 
355 360 365 

Ala Lys Val Val Val lie Leu Leu Leu Ala Ala Gly Val Asp Ala Arg 
370 375 380 

Thr His Thr Val Gly Gly Ser Ala Ala Gin Thr Thr Gly Arg Leu Thr 
385 390 395 400 

Ser Leu Phe Asp Met Gly Pro Arg Gin Lys lie Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu His Thr Gly Phe lie Ala Ser Leu Phe Tyr Thr His Ser Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Met Ser Ala Cys Arg Ser lie Glu Ala 
450 455 460 

Phe Arg Val Gly Trp Gly Ala Leu Gin Tyr Glu Asp Asn Val Thr Asn 
465 470 475 480 

Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Arg Gin Cys 
485 490 495 

Gly Val Val Ser Ala Lys Thr Val Cys Gly Pro Val Tyr Cys Phe Thr 
500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Leu Gly Ala Pro Thr 
515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 
530 535 540 

Arg Pro Pro Leu Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Ser 
545 550 555 560 

Gly Tyr Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 
565 570 575 

Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 
580 585 590 

His Pro Asp Thr Thr Tyr Leu Lys Cys Gly Ser Gly Pro Trp Leu Thr 
595 600 605 
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Pro Arg Cys Leu He Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
610 615 620 

Thr Val Asn Tyr Thr He Phe Lys He Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 
645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 
660 665 670 

Thr Thr Glu Trp Ala lie Leu Pro Cys Ser Tyr Ser Asp Leu Pro Ala 
675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn He Val Asp Val Gin 
690 695 700 

Phe Met Tyr Gly Leu Ser Pro Ala Leu Thr Lys Tyr He Val Arg Trp 
705 710 715 720 

Glu Trp Val He Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
725 730 735 

Ala Cys Leu Trp Met Leu He Leu Leu Gly Gin Ala Glu Ala Ala Leu 
740 745 750 

Glu Lys Leu Val He Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly 
755 760 765 

Phe Leu Tyr Phe Val He Phe Phe Val Ala Ala Trp Tyr He Lys Gly 
770 775 780 

Arg Val Val Pro Leu Ala Thr Tyr Ser Leu Thr Gly Leu Trp Ser Phe 
785 790 795 800 

Ser Leu Leu Leu Leu Ala Leu Pro Gin Gin Ala Tyr Ala Leu Asp Thr 
805 810 815 

Glu Val Ala Ala Ser Cys Gly Gly Val Val Leu Val Gly Leu Met Ala 
820 825 830 

Leu Thr Leu Ser Pro Tyr Tyr Lys Arg Tyr He Ser Trp Cys Met Trp 
835 840 845 

Trp Leu Gin Tyr Phe Leu Thr Arg Val Glu Ala Gin Leu His Val Trp 
850 855 860 
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Val Pro Pro Leu Asn Val Arg Gly Gly Arg Asp Ala Val lie Leu Leu 
865 870 875 880 

Met Cys Val Val His Pro Thr Leu Val Phe Asp lie Thr Lys Leu Leu 
885 890 895 

Leu Ala lie Phe Gly Pro Leu Trp lie Leu Gin Ala Ser Leu Leu Lys 
900 905 910 

Val Pro Tyr Phe Val Arg Val Gin Gly Leu Leu Arg He Cys Ala Leu 
915 920 925 

Ala Arg Lys He Ala Gly Gly His Tyr Val Gin Met Ala lie lie Lys 
930 935 940 

Leu Gly Ala Leu Thr Gly Thr Tyr Val Tyr Asn His Leu Thr Pro Leu 
945 950 955 960 

Arg Asp Trp Ala His Asn Gly Leu Arg Asp Leu Ala Val Ala Val Glu 
965 970 975 

Pro Val Val Phe Ser Arg Met Glu Thr Lys Leu He Thr Trp Gly Ala 
980 985 990 

Asp Thr Ala Ala Cys Gly Asp He He Asn Gly Leu Pro Val Ser Ala 
995 1000 1005 

Arg Arg Gly Gin Glu He Leu Leu Gly Pro Ala Asp Gly Met Val Ser 
1010 1015 1020 

Lys Gly Trp Arg Leu Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Cys He He Thr Ser Leu Thr Gly Arg Asp Lys 
1045 1050 1055 

Asn Gin Val Glu Gly Glu Val Gin lie Val Ser Thr Ala Thr Gin Thr 
1060 1065 1070 

Phe Leu Ala Thr Cys He Asn Gly Val Cys Trp Thr Val Tyr His Gly 
1075 1080 1085 

Ala Gly Thr Arg Thr He Ala Ser Pro Lys Gly Pro Val He Gin Met 
1090 1095 1100 

Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly 
1105 1110 1115 1120 
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Ser Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu 
1125 1130 1135 

Val Thr Arg His Ala Asp Val He Pro Val Arg Arg Arg Gly Asp Ser 
1140 1145 1150 

Arg Gly Ser Leu Leu Ser Pro Arg Pro He Ser Tyr Leu Lys Gly Ser 
1155 1160 1165 

Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val Gly Leu Phe 
1170 1175 1180 

Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Asn Leu Gly Thr Thr Met Arg Ser Pro Val Phe Thr Asp 
1205 1210 1215 

Asn Ser Ser Pro Pro Ala Val Pro Gin Ser Phe Gin Val Ala His Leu 
1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr 
1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 
1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly Val Asp Pro 
1265 1270 1275 1280 

Asn He Arg Thr Gly Val Arg Thr He Thr Thr Gly Ser Pro He Thr 
1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly 
1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Thr Asp Ala Thr 
1315 1320 1325 

Ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 
1330 1335 1340 

Ala Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Val Ser His Pro Asn He Glu Glu Val Ala Leu Ser Thr Thr Gly Glu 
1365 1370 1375 
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lie Pro Phe Tyr Gly Lys Ala lie Pro Leu Glu Val lie Lys Gly Gly 
1380 1385 1390 

Arg His Leu lie Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 
1395 1400 1405 

Ala Lys Leu Val Ala Leu Gly lie Asn Ala Val Ala Tyr Tyr Arg Gly 
1410 1415 1420 

Leu Asp Val Ser Val lie Pro Thr Ser Gly Asp Val Val Val Val Ser 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val lie 
1445 1450 1455 

Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro 
1460 1465 1470 

Thr Phe Thr lie Glu Thr Thr Thr Leu Pro Gin Asp Ala Val Ser Arg 
1475 1480 1485 

Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro Gly lie Tyr Arg 
1490 1495 1500 

Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro 
1525 1530 1535 

Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met Asn Thr Pro Gly Leu 
1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Gly Val Phe Thr Gly 
1555 1560 1565 

Leu Thr His lie Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 
1570 1575 1580 

Glu Asn Phe Pro Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin Met Trp Lys Cys Leu lie 
1605 1610 1615 

Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu 
1620 1625 1630 
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Gly Ala Val Gin Asn Glu Val Thr Leu Thr His Pro lie Thr Lys Tyr 
1635 1640 1645 

lie Met Thr Cys Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp 
1650 1655 1660 

Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser 
1665 1670 1675 1680 

Thr Gly Cys Val Val He Val Gly Arg He Val Leu Ser Gly Lys Pro 
1685 1690 1695 

Ala He He Pro Asp Arg Glu Val Leu Tyr Gin Glu Phe Asp Glu Met 
1700 1705 1710 

Glu Glu Cys Ser Gin His Leu Pro Tyr He Glu Gin Gly Met Met Leu 
1715 1720 1725 

Ala Glu Gin Phe Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr Ala Ser 
1730 1735 1740 

Arg His Ala Glu Val He Thr Pro Ala Val Gin Thr Asn Trp Gin Lys 
1745 1750 1755 1760 

Leu Glu Val Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly He 
1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala He Ala 
1780 1785 1790 

Ser Leu Met Ala Phe Thr Ala Ala Val Thr Ser Pro Leu Thr Thr Gly 
1795 1800 1805 

Gin Thr Leu Leu Phe Asn He Leu Gly Gly Trp Val Ala Ala Gin Leu 
1810 1815 1820 

Ala Ala Pro Gly Ala Ala Thr Ala Phe Val Gly Ala Gly Leu Ala Gly 
1825 1830 1835 1840 

Ala Ala He Gly Ser Val Gly Leu Gly Lys Val Leu Val Asp He Leu 
1845 1850 1855 

Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu Val Ala Phe Lys He 
I860 1865 1870 

Met Ser Gly Glu Val Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro 
1875 1880 1885 
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Ala lie Leu Ser Pro Gly Ala Leu Val Val Gly Val Val Cys Ala Ala 
1890 1895 1900 

lie Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu lie Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr 
1925 1930 1935 

His Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg Val Thr Ala He Leu 
1940 1945 1950 

Ser Ser Leu Thr Val Thr Gin Leu Leu Arg Arg Leu His Gin Trp He 
1955 1960 1965 

Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp He 
1970 1975 1980 

Trp Asp Trp He Cys Glu Val Leu Ser Asp Phe Lys Thr Trp Leu Lys 
1985 1990 1995 2000 

Ala Lys Leu Met Pro Gin Leu Pro Gly He Pro Phe Val Ser Cys Gin 
2005 2010 2015 

Arg Gly Tyr Arg Gly Val Trp Arg Gly Asp Gly He Met His Thr Arg 
2020 2025 2030 

Cys His Cys Gly Ala Glu He Thr Gly His Val Lys Asn Gly Thr Met 
2035 2040 2045 

Arg He Val Gly Pro Arg Thr Cys Arg Asn Met Trp Ser Gly Thr Phe 
2050 2055 2060 

Pro He Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro Leu Pro Ala Pro 
2065 2070 2075 2080 

Asn Tyr Lys Phe Ala Leu Trp Arg Val Ser Ala Glu Glu Tyr Val Glu 
2085 2090 2095 

He Arg Arg Val Gly Asp Phe His Tyr Val Ser Gly Met Thr Thr Asp 
2100 2105 2110 

Asn Leu Lys Cys Pro Cys Gin He Pro Ser Pro Glu Phe Phe Thr Glu 
2115 2120 2125 

Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro Pro Cys Lys Pro Leu 
2130 2135 2140 
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Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu His Glu Tyr Pro Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr 
2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu Ala Ala Gly Arg 
2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Met Ala Ser Ser Ser Ala Ser 
2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Thr Ala Asn His Asp 
2210 2215 2220 

Ser Pro Asp Ala Glu Leu He Glu Ala Asn Leu Leu Trp Arg Gin Glu 
2225 2230 2235 2240 

Met Gly Gly Asn He Thr Arg Val Glu Ser Glu Asn Lys Val Val He 
2245 2250 2255 

Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu Asp Glu Arg Glu Val 
2260 2265 2270 

Ser Val Pro Ala Glu He Leu Arg Lys Ser Arg Arg Phe Ala Arg Ala 
2275 2280 2285 

Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Thr 
2290 2295 2300 

Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val His Gly Cys Pro Leu 
2305 2310 2315 2320 

Pro Pro Pro Arg Ser Pro Pro Val Pro Pro Pro Arg Lys Lys Arg Thr 
2325 2330 2335 

Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala Leu Ala Glu Leu Ala 
2340 2345 2350 

Thr Lys Ser Phe Gly Ser Ser Ser Thr Ser Gly He Thr Gly Asp Asn 
2355 2360 2365 

Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys Pro Pro Asp Ser 
2370 2375 2380 

Asp Val Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
2385 2390 2395 2400 
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Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val Ser Ser Gly Ala 
2405 2410 2415 

Asp Thr Glu Asp Val Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly 
2420 2425 2430 

Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin Lys Leu Pro lie Asn 
2435 2440 2445 

Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Leu Val Tyr Ser Thr 
2450 2455 2460 

Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys Val Thr Phe Asp Arg 
2465 2470 2475 2480 

Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val Leu Lys Glu Val Lys 
2485 2490 2495 

Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu Ser Val Glu Glu Ala 
2500 2505 2510 

Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser Lys Phe Gly Tyr Gly 
2515 2520 2525 

Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala Val Ala His lie Asn 
2530 2535 2540 



Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Val Thr Pro lie Asp Thr 
2545 2550 2555 2560 

Thr lie Met Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu Lys Gly 
2565 2570 2575 

Gly Arg Lys Pro Ala Arg Leu lie Val Phe Pro Asp Leu Gly Val Arg 
2580 2585 2590 

Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Ser Lys Leu Pro Leu 
2595 2600 2605 

Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg 
2610 2615 2620 



Val Glu Phe Leu Val Gin Ala Trp Lys Ser Lys Lys Thr Pro Met Gly 

2625 2630 2635 2640 

Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser Asp 

2645 2650 2655 
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He Arg Thr Glu Glu Ala He Tyr Gin Cys Cys Asp Leu Asp Pro Gin 
2660 2665 2670 

Ala Arg Val Ala lie Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly 
2675 2680 2685 

Pro Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg 
2690 2695 2700 

Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr 
2705 2710 2715 2720 

He Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr 
2725 2730 2735 

Met Leu Val Cys Gly Asp Asp Leu Val Val He Cys Glu Ser Ala Gly 
2740 2745 2750 

Val Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr 
2755 2760 2765 

Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu 
2770 2775 2780 

Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala His Asp Gly 
2785 2790 2795 2800 

Ala Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu 
2805 2810 2815 

Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro Val Asn Ser Trp 
2820 2825 2830 

Leu Gly Asn He He Met Phe Ala Pro Thr Leu Trp Ala Arg Met He 
2835 2840 2845 

Leu Met Thr His Phe Phe Ser Val Leu He Ala Arg Asp Gin Leu Glu 
2850 2855 2860 

Gin Ala Leu Asn Cys Glu He Tyr Gly Ala Cys Tyr Ser He Glu Pro 
2865 2870 2875 2880 

Leu Asp Leu Pro Pro He He Gin Arg Leu His Gly Leu Ser Ala Phe 
2885 2890 2895 

Ser Leu His Ser Tyr Ser Pro Gly Glu He Asn Arg Val Ala Ala Cys 
2900 2905 2910 
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Leu Arg Lys Leu Gly Val Pro Pro Leu Arg Ala Trp Arg His Arg Ala 
2915 2920 2925 

Arg Ser Val Arg Ala Arg Leu Leu Ser Arg Gly Gly Arg Ala Ala lie 
2930 2935 2940 

Cys Gly Lys Tyr Leu Phe Asn Trp Ala Val Arg Thr Lys Leu Lys Leu 
2945 2950 2955 2960 

Thr Pro lie Ala Ala Ala Gly Arg Leu Asp Leu Ser Gly Trp Phe Thr 
2965 2970 2975 

Ala Gly Tyr Ser Gly Gly Asp lie Tyr His Ser Val Ser His Ala Arg 
2980 2985 2990 

Pro Arg Trp Phe Trp Phe Cys Leu Leu Leu Leu Ala Ala Gly Val Gly 
2995 3000 3005 

lie Tyr Leu Leu Pro Asn Arg 
3010 3015 



<210> 9 
<211> 9611 
<212> DNA 

<213> Hepatitis C virus 
<400> 9 

gccagccccc tgatgggggc gacactccac 
tcttcacgca gaaagcgtct agccatggcg 
cccccctccc gggagagcca tagtggtctg 
gacgaccggg tcctttcttg gataaacccg 
gcaagactgc tagccgagta gtgttgggtc 
gtgcttgcga gtgccccggg aggtctcgta 
ctcaaagaaa aaccaaaaga aacaccaacc 
gcggccagat cgttggcgga gtatacttgt 
gcgcgacaag gaagacttcg gagcggtccc 
aagatcggcg ctccactggc aaatcctggg 
ggaatgaggg actcggctgg gcaggatggc 
ggggccccaa tgacccccgg cataggtcgc 
cgtgcggctt tgccgacctc atggggtaca 
tcgccagagc tctcgcgcat ggcgtgagag 
ggaacttacc cggttgctcc ttttctatct 
ccccggtctc cgctgccgaa gtgaagaaca 
gcaccaatga cagcattacc tggcagctcc 
tcccgtgcga gaaagtgggg aatgcatctc 
ccgtgcagcg gcccggcgcc ctcacgcagg 
tgtccgccac gctctgctct gccctctacg 



catgaatcac tcccctgtga ggaactactg 60 
ttagtatgag tgtcgtgcag cctccaggac 120 
cggaaccggt gagtacaccg gaattgccag 180 
ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgaaaggcc ttgtggtact gcctgatagg 3 00 
gaccgtgcac catgagcaca aatcctaaac 3 60 
gtcgcccaca agacgttaag tttccgggcg 420 
tgccgcgcag gggccccagg ttgggtgtgc 4 80 
agccacgtgg aaggcgccag cccatcccta 540 
gaaaaccagg atacccctgg cccctatacg 600 
tcctgtcccc ccgaggttcc cgtccctctt 660 
gcaacgtggg taaggtcatc gataccctaa 720 
tccctgtcgt gggcgccccg ctcggcggcg 780 
tcctggagga cggggttaat tttgcaacag 840 
tcttgctggc cctgctgtcc tgcatcacca 900 
tcagtaccgg ctacatggtg actaacgact 960 
aggctgctgt cctccacgtc cccgggtgcg 1020 
agtgctggat accggtctca ccgaatgtgg 1080 
gcttgcggac gcacatcgac atggttgtga 114 0 
tgggggacct ctgcggtggg gtgatgctcg 1200 
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cagcccaaat gttcattgtc tcgccgcagc 
ccatctaccc tggtaccatc actggacacc 
cgcccacggc taccatgatc ttggcgtacg 
tcattagcgg ggctcattgg ggcgtcatgt 
cgtgggcgaa agtcgttgtc atccttctgt 
ctgttggggg ttctgccgcg cagaccaccg 
ccaggcagaa aatccagctc gttaacacca 
tgaactgcaa tgactccttg cacaccggct 
tcaactcgtc aggatgtccc gaacgcatgt 
tgggatgggg cgccttgcaa tatgaggata 
attgctggca ctacccacca aggcagtgtg 
cagtgtactg tttcaccccc agcccagtgg 
ccacttacac gtggggggag aatgagacag 
cgctggggtc atggttcggc tgcacgtgga 
gcgcaccacc ctgccgtact agagctgact 
cggactgttt taggaagcat cctgatacca 
tcacgccaag gtgcctgatc gactacccct 
actataccat cttcaaaata aggatgtatg 
catgcaattt cactcgtggg gatcgttgca 
ctcctttgtt gcactccacc acggaatggg 
ccgccttgtc gactggtctt ctccacctcc 
atggcctatc acctgccctc acaaaataca 
tcctgctctt agcggacgcc agggtttgcg 
aggccgaagc agctttggag aacctcgtaa 
acggtcttgt gtccttcctc gtgttcttct 
tgcccggagc ggtctacgcc ctctacggga 
tgcctcagcg ggcatatgca ctggacacgg 
ttgtcgggtt aatggcgctg actctgtcgc 
tgtggtggct tcagtatttt ctgaccagag 
ccctcaacgt ccgggggggg cgcgatgccg 
ccctggtatt tgacatcacc aaactactcc 
aagccagttt gcttaaagtc ccctacttcg 
cgctagcgcg gaagatagcc ggaggtcatt 
cgcttactgg cacctatgtg tataaccatc 
gcctgcgaga tctggccgtg gctgtggaac 
tcatcacgtg gggggcagat accgccgcgt 
ctgcccgtag gggccaggag atactgcttg 
99&9gttgct ggcgcccatc acggcgtacg 
taatcaccag cctgactggc cgggacaaaa 
caactgctac ccaaaccttc ctggcaacgt 
acggggccgg aacgaggacc atcgcatcac 
atgtggacca agaccttgtg ggctggcccg 
gtacctgcgg ctcctcggac ctttacctgg 
gccggcgagg tgatagcagg ggtagcctgc 
gctcctcggg gggtccgctg ttgtgccccg 
cggtgtgcac ccgtggagtg gctaaagcgg 
caaccatgag atccccggtg ttcacggaca 
tccaggtggc ccacctgcat gctcccaccg 



accactggtt tgtccaagac tgcaattgct 1260 
gcatggcatg ggacatgatg atgaactggt 1320 
cgatgcgtgt ccccgaggtc attatagaca 1380 
tcggcttggc ctacttctct atgcagggag 1440 
tggccgccgg ggtggacgcg cgcacccata 1500 
ggcgcctcac cagcttattt gacatgggcc 1560 
atggcagctg gcacatcaac cgcaccgccc 1620 
ttatcgcgtc tctgttctac acccacagct 1680 
ccgcctgccg cagtatcgag gccttccggg 1740 
atgtcaccaa tccagaggat atgagaccct 1800 
gcgtggtctc cgcgaagact gtgtgtggcc 1860 
tagtgggcac gaccgacagg cttggagcgc 1920 
atgtcttcct attgaacagc actcgaccac 1980 
tgaactcttc tggctacacc aagacttgcg 2 04 0 
tcaacgccag cacggacctg ttgtgcccca 2100 
cttacctcaa atgcggctct gggccctggc 2160 
acaggctctg gcattacccc tgcacagtta 2220 
tgggaggggt tgagcacagg ctcacggctg 22 80 
acttggagga cagagacaga agtcaactgt 234 0 
ccattttacc ttgctcttac tcggacctgc 24 00 
accaaaacat cgtggacgta caattcatgt 2460 
tcgtccgatg ggagtgggta atactcttat 252 0 
cctgcttatg gatgctcatc ttgttgggcc 2580 
tactcaatgc agcatccctg gccgggacgc 264 0 
gctttgcgtg gtatctgaag ggtaggtggg 2 7 00 
tgtggcctct cctcctgctc ctgctggcgt 2760 
aggtggccgc gtcgtgtggc ggcgttgttc 2820 
catattacaa gcgctatatc agctggtgca 2880 
tagaagcgca actgcacgtg tgggttcccc 2 94 0 
tcatcttact catgtgtgta gtacacccga 3000 
tggccatctt cggacccctt tggattcttc 3060 
tgcgcgttca aggccttctc cggatctgcg 3120 
acgtgcaaat ggccatcatc aagttagggg 3180 
tcacccctct tcgagactgg gcgcacaacg 3240 
cagtcgtctt ctcccgaatg gagaccaagc 3300 
gcggtgacat catcaacggc ttgcccgtct 3 360 
ggccagccga cggaatggtc tccaaggggt 3420 
cccagcagac gagaggcctc ctagggtgta 34 80 
accaagtgga gggtgaggtc cagatcgtgt 3540 
gcatcaatgg ggtatgctgg actgtctacc 3600 
ccaagggtcc tgtcatccag atgtatacca 3660 
ctcctcaagg ttcccgctca ttgacaccct 3720 
tcacgaggca cgccgatgtc attcccgtgc 3780 
tttcgccccg gcccatttcc tacttgaaag 3840 
cgggacacgc cgtgggccta ttcagggccg 3900 
tggactttat ccctgtggag aacctaggga 3 960 
actcctctcc accagcagtg ccccagagct 4020 
gcagcggtaa gagcaccaag gtcccggctg 4080 
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cgtacgcagc ccagggctac aaggtgttgg 
gctttggtgc ttacatgtcc aaggcccatg 
gaacaattac cactggcagc cccatcacgt 
gcgggtgctc aggaggtgct tatgacataa 
ccacatccat cttgggcatc ggcactgtcc 
tggttgtgct cgccactgct acccctccgg 
aggaggttgc tctgtccacc accggagaga 
aggtgatcaa ggggggaaga catctcatct 
tcgccgcgaa gctggtcgca ttgggcatca 
tgtctgtcat cccgaccagc ggcgatgttg 
gctttaccgg cgacttcgac tctgtgatag 
atttcagcct tgaccctacc tttaccattg 
ccaggactca acgccggggc aggactggca 
caccggggga gcgcccctcc ggcatgttcg 
cgggctgtgc ttggtatgag ctcacgcccg 
tgaacacccc ggggcttccc gtgtgccagg 
cgggcctcac tcatatagat gcccactttt 
ttccttacct ggtagcgtac caagccaccg 
cgtgggacca gatgtggaag tgtttgatcc 
ccctgctata cagactgggc gctgttcaga 
aatacatcat gacatgcatg tcggccgacc 
ttggcggcgt cctggctgct ctggccgcgt 
tgggcaggat cgtcttgtcc gggaagccgg 
aggagttcga tgagatggaa gagtgctctc 
tgctcgctga gcagttcaag cagaaggccc 
cagaggttat cacccctgct gtccagacca 
agcacatgtg gaatttcatc agtgggatac 
gtaaccccgc cattgcttca ttgatggctt 
ctggccaaac cctcctcttc aacatattgg 
ccggtgccgc tactgccttt gtgggtgctg 
gactggggaa ggtcctcgtg gacattcttg 
ttgtagcatt caagatcatg agcggtgagg 
tgcccgccat cctctcgcct ggagcccttg 
gccggcacgt tggcccgggc gagggggcag 
cctcccgggg gaaccatgtt tcccccacgc 
gcgtcactgc catactcagc agcctcactg 
ggataagctc ggagtgtacc actccatgct 
ggatatgcga ggtgctgagc gactttaaga 
tgcctgggat tccctttgtg tcctgccagc 
gcattatgca cactcgctgc cactgtggag 
cgatgaggat cgtcggtcct aggacctgca 
acgcctacac cacgggcccc tgtactcccc 
ggagggtgtc tgcagaggaa tacgtggaga 
cgggtatgac tactgacaat cttaaatgcc 
cagaattgga cggggtgcgc ctacacaggt 
aggaggtatc attcagagta ggactccacg 
agcccgaacc ggacgtagcc gtgttgacgt 
cagaggcggc cgggagaagg ttggcgagag 



tgctcaaccc ctctgttgct gcaacgctgg 4140 
gggttgatcc taatatcagg accggggtga 4200 
actccaccta cggcaagttc cttgccgacg 4260 
taatttgtga cgagtgccac tccacggatg 4320 
ttgaccaagc agagactgcg ggggcgagac 4380 
gctccgtcac tgtgtcccat cctaacatcg 4440 
tcccctttta cggcaaggct atccccctcg 4500 
tctgccactc aaagaagaag tgcgacgagc 4560 
atgccgtggc ctactaccgc ggtcttgacg 4620 
tcgtcgtgtc gaccgatgct ctcatgactg 4680 
actgcaacac gtgtgtcact cagacagtcg 4740 
agacaaccac gctcccccag gatgctgtct 4800 
gggggaagcc aggcatctat agatttgtgg 4860 
actcgtccgt cctctgtgag tgctatgacg 4 92 0 
ccgagactac agttaggcta cgagcgtaca 4 980 
accatcttga attttgggag ggcgtcttta 5040 
tatcccagac aaagcagagt ggggagaact 5100 
tgtgcgctag ggctcaagcc cctcccccat 5160 
gccttaaacc caccctccat gggccaacac 5220 
atgaagtcac cctgacgcac ccaatcacca 5280 
tggaggtcgt cacgagcacc tgggtgctcg 5340 
attgcctgtc aacaggctgc gtggtcatag 5400 
caattatacc tgacagggag gttctctacc 5460 
agcacttacc gtacatcgag caagggatga 5520 
tcggcctcct gcagaccgcg tcccgccatg 5580 
actggcagaa actcgaggtc ttttgggcga 5640 
aatacttggc gggcctgtca acgctgcctg 5700 
ttacagctgc cgtcaccagc ccactaacca 5760 
99gggtgggt ggctgcccag ctcgccgccc 5820 
gcctagctgg cgccgccatc ggcagcgttg 5880 
cagggtatgg cgcgggcgtg gcgggagctc 594 0 
tcccctccac ggaggacctg gtcaatctgc 6000 
tagtcggtgt ggtctgcgca gcaatactgc 6060 
tgcaatggat gaaccggcta atagccttcg 6120 
actacgtgcc ggagagcgat gcagccgccc 6180 
taacccagct cctgaggcga ctgcatcagt 6240 
ccggttcctg gctaagggac atctgggact 63 00 
cctggctgaa agccaagctc atgccacaac 6360 
gcgggtatag gggggtctgg cgaggagacg 6420 
ctgagatcac tggacatgtc aaaaacggga 64 80 
ggaacatgtg gagtgggacg ttccccatta 654 0 
ttcctgcgcc gaactataag ttcgcgctgt 6600 
taaggcgggt gggggacttc cactacgtat 6660 
cgtgccagat cccatcgccc gaatttttca 6720 
ttgcgccccc ttgcaagccc ttgctgcggg 6780 
agtacccggt ggggtcgcaa ttaccttgcg 6840 
ccatgctcac tgatccctcc catataacag 6900 
ggtcaccccc ttctatggcc agctcctcgg 6960 
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ctagccagct gtccgctcca tctctcaagg caacttgcac cgccaaccat gactcccctg 7020 
acgccgagct catagaggct aacctcctgt ggaggcagga gatgggcggc aacatcacca 7080 
gggttgagtc agagaacaaa gtggtgattc tggactcctt cgatccgctt gtggcagagg 7140 
aggatgagcg ggaggtctcc gtacctgcag aaattctgcg gaagtctcgg agattcgccc 72 00 
gggccctgcc cgtctgggcg cggccggact acaacccccc gctagtagag acgtggaaaa 7260 
agcctgacta cgaaccacct gtggtccatg gctgcccgct accacctcca cggtcccctc 7320 
ctgtgcctcc gcctcggaaa aagcgtacgg tggtcctcac cgaatcaacc ctatctactg 73 80 
ccttggccga gcttgccacc aaaagttttg gcagctcctc aacttccggc attacgggcg 744 0 
acaatacgac aacatcctct gagcccgccc cttctggctg cccccccgac tccgacgttg 7500 
agtcctattc ttccatgccc cccctggagg gggagcctgg ggatccggat ctcagcgacg 7560 
ggtcatggtc gacggtcagt agtggggccg acacggaaga tgtcgtgtgc tgctcaatgt 7620 
cttattcctg gacaggcgca ctcgtcaccc cgtgcgctgc ggaagaacaa aaactgccca 7680 
tcaacgcact gagcaactcg ttgctacgcc atcacaatct ggtgtattcc accacttcac 774 0 
gcagtgcttg ccaaaggcag aagaaagtca catttgacag actgcaagtt ctggacagcc 78 00 
attaccagga cgtgctcaag gaggtcaaag cagcggcgtc aaaagtgaag gctaacttgc 7860 
tatccgtaga ggaagcttgc agcctgacgc ccccacattc agccaaatcc aagtttggct 7920 
atggggcaaa agacgtccgt tgccatgcca gaaaggccgt agcccacatc aactccgtgt 7980 
ggaaagacct tctggaagac agtgtaacac caatagacac taccatcatg gccaagaacg 804 0 
aggttttctg cgttcagcct gagaaggggg gtcgtaagcc agctcgtctc atcgtgttcc 8100 
ccgacctggg cgtgcgcgtg tgcgagaaga tggccctgta cgacgtggtt agcaagctcc 8160 
ccctggccgt gatgggaagc tcctacggat tccaatactc accaggacag cgggttgaat 8220 
tcctcgtgca agcgtggaag tccaagaaga ccccgatggg gttctcgtat gatacccgct 82 80 
gttttgactc cacagtcact gagagcgaca tccgtacgga ggaggcaatt taccaatgtt 834 0 
gtgacctgga cccccaagcc cgcgtggcca tcaagtccct cactgagagg ctttatgttg 84 00 
ggggccctct taccaattca aggggggaaa actgcggcta ccgcaggtgc cgcgcgagcg 8460 
gcgtactgac aactagctgt ggtaacaccc tcacttgcta catcaaggcc cgggcagcct 8520 
gtcgagccgc agggctccag gactgcacca tgctcgtgtg tggcgacgac ttagtcgtta 8580 
tctgtgaaag tgcgggggtc caggaggacg cggcgagcct gagagccttc acggaggcta 864 0 
tgaccaggta ctccgccccc cccggggacc ccccacaacc agaatacgac ttggagctta 8700 
taacatcatg ctcctccaac gtgtcagtcg cccacgacgg cgctggaaag agggtctact 8760 
accttacccg tgaccctaca acccccctcg cgagagccgc gtgggagaca gcaagacaca 8820 
ctccagtcaa ttcctggcta ggcaacataa tcatgtttgc ccccacactg tgggcgagga 8880 
tgatactgat gacccatttc tttagcgtcc tcatagccag ggatcagctt gaacaggctc 8940 
ttaactgtga gatctacgga gcctgctact ccatagaacc actggatcta cctccaatca 9000 
ttcaaagact ccatggcctc agcgcatttt cactccacag ttactctcca ggtgaaatca 9060 
atagggtggc cgcatgcctc agaaaacttg gggtcccgcc cttgcgagct tggagacacc 9120 
gggcccggag cgtccgcgct aggcttctgt ccagaggagg cagggctgct atatgtggca 9180 
agtacctctt caactgggca gtaagaacaa agctcaaact cactccaata gcggccgctg 9240 
gccggctgga cttgtccggt tggttcacgg ctggctacag cgggggagac atttatcaca 93 00 
gcgtgtctca tgcccggccc cgctggttct ggttttgcct actcctgctc gctgcagggg 9360 
taggcatcta cctcctcccc aaccgatgaa ggttggggta aacactccgg cctcttaagc 9420 
catttcctgt tttttttttt tttttttttt tttttttctt tttttttttc tttcctttcc 9480 
ttcttttttt cctttctttt tcccttcttt aatggtggct ccatcttagc cctagtcacg 9540 
gctagctgtg aaaggtccgt gagccgcatg actgcagaga gtgctgatac tggcctctct 9600 
gcagatcatg t 9611 



<210> 10 
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<211> 3015 
<212> PRT 

<213> Hepatitis C virus 
<400> 10 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val lie Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Val Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys lie Thr Thr Pro Val Ser Ala Ala 
180 185 190 

Glu Val Lys Asn lie Ser Thr Gly Tyr Met Val Thr Asn Asp Cys Thr 
195 200 205 

Asn Asp Ser lie Thr Trp Gin Leu Gin Ala Ala Val Leu His Val Pro 
210 215 220 

Gly Cys Val Pro Cys Glu Lys Val Gly Asn Ala Ser Gin Cys Trp lie 
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225 230 235 240 

Pro Val Ser Pro Asn Val Ala Val Gin Arg Pro Gly Ala Leu Thr Gin 
245 250 255 

Gly Leu Arg Thr His He Asp Met Val Val Met Ser Ala Thr Leu Cys 
260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 
275 280 285 

Gin Met Phe He Val Ser Pro Gin His His Trp Phe Val Gin Asp Cys 
290 295 300 

Asn Cys Ser He Tyr Pro Gly Thr He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met lie Leu Ala Tyr 
325 330 335 

Ala Met Arg Val Pro Glu Val He He Asp He He Ser Gly Ala His 
340 345 350 

Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 
355 360 365 

Ala Lys Val Val Val He Leu Leu Leu Ala Ala Gly Val Asp Ala Arg 
370 375 380 

Thr His Thr Val Gly Gly Ser Ala Ala Gin Thr Thr Gly Arg Leu Thr 
385 390 395 400 

Ser Leu Phe Asp Met Gly Pro Arg Gin Lys He Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu His Thr Gly Phe He Ala Ser Leu Phe Tyr Thr His Ser Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Met Ser Ala Cys Arg Ser He Glu Ala 
450 455 460 

Phe Arg Val Gly Trp Gly Ala Leu Gin Tyr Glu Asp Asn Val Thr Asn 
465 470 475 480 

Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Arg Gin Cys 
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485 490 495 

Gly Val Val Ser Ala Lys Thr Val Cys Gly Pro Val Tyr Cys Phe Thr 
500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Leu Gly Ala Pro Thr 
515 520 525 

Tyr Thr Trp Gly Glu Asri Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 
530 535 540 

Arg Pro Pro Leu Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Ser 
545 550 555 560 

Gly Tyr Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 
565 570 575 

Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 
580 585 590 

His Pro Asp Thr Thr Tyr Leu Lys Cys Gly Ser Gly Pro Trp Leu Thr 
595 600 605 

Pro Arg Cys Leu lie Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
610 615 620 

Thr Val Asn Tyr Thr lie Phe Lys lie Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 
645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 
660 665 670 

Thr Thr Glu Trp Ala lie Leu Pro Cys Ser Tyr Ser Asp Leu Pro Ala 
675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn lie Val Asp Val Gin 
690 695 700 

Phe Met Tyr Gly Leu Ser Pro Ala Leu Thr Lys Tyr lie Val Arg Trp 
705 710 715 720 

Glu Trp Val lie Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
725 730 735 

Ala Cys Leu Trp Met Leu lie Leu Leu Gly Gin Ala Glu Ala Ala Leu 
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740 745 750 

Glu Asn Leu Val lie Leu Asn Ala Ala Ser Leu Ala Gly Thr His Gly 
755 760 765 

Leu Val Ser Phe Leu Val Phe Phe Cys Phe Ala Trp Tyr Leu Lys Gly 
770 775 780 

Arg Trp Val Pro Gly Ala Val Tyr Ala Leu Tyr Gly Met Trp Pro Leu 
785 790 795 800 

Leu Leu Leu Leu Leu Ala Leu Pro Gin Arg Ala Tyr Ala Leu Asp Thr 
805 810 815 

Glu Val Ala Ala Ser Cys Gly Gly Val Val Leu Val Gly Leu Met Ala 
820 825 830 

Leu Thr Leu Ser Pro Tyr Tyr Lys Arg Tyr lie Ser Trp Cys Met Trp 
835 840 845 

Trp Leu Gin Tyr Phe Leu Thr Arg Val Glu Ala Gin Leu His Val Trp 
850 855 860 

Val Pro Pro Leu Asn Val Arg Gly Gly Arg Asp Ala Val lie Leu Leu 
865 870 875 880 

Met Cys Val Val His Pro Thr Leu Val Phe Asp lie Thr Lys Leu Leu 
885 890 895 

Leu Ala lie Phe Gly Pro Leu Trp lie Leu Gin Ala Ser Leu Leu Lys 
900 905 910 

Val Pro Tyr Phe Val Arg Val Gin Gly Leu Leu Arg lie Cys Ala Leu 
915 920 925 

Ala Arg Lys lie Ala Gly Gly His Tyr Val Gin Met Ala lie He Lys 
930 935 940 

Leu Gly Ala Leu Thr Gly Thr Tyr Val Tyr Asn His Leu Thr Pro Leu 
945 950 955 960 

Arg Asp Trp Ala His Asn Gly Leu Arg Asp Leu Ala Val Ala Val Glu 
965 970 975 

Pro Val Val Phe Ser Arg Met Glu Thr Lys Leu He Thr Trp Gly Ala 
980 985 990 

Asp Thr Ala Ala Cys Gly Asp He He Asn Gly Leu Pro Val Ser Ala 
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995 1000 1005 

Arg Arg Gly Gin Glu lie Leu Leu Gly Pro Ala Asp Gly Met Val Ser 
1010 1015 1020 

Lys Gly Trp Arg Leu Leu Ala Pro lie Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Cys lie lie Thr Ser Leu Thr Gly Arg Asp Lys 
1045 1050 1055 

Asn Gin Val Glu Gly Glu Val Gin lie Val Ser Thr Ala Thr Gin Thr 
1060 1065 1070 

Phe Leu Ala Thr Cys lie Asn Gly Val Cys Trp Thr Val Tyr His Gly 
1075 1080 1085 

Ala Gly Thr Arg Thr lie Ala Ser Pro Lys Gly Pro Val lie Gin Met 
1090 1095 1100 

Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly 
1105 1110 1115 1120 

Ser Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu 
1125 1130 1135 

Val Thr Arg His Ala Asp Val lie Pro Val Arg Arg Arg Gly Asp Ser 
1140 1145 1150 

Arg Gly Ser Leu Leu Ser Pro Arg Pro lie Ser Tyr Leu Lys Gly Ser 
1155 1160 1165 

Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val Gly Leu Phe 
1170 1175 1180 

Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe lie 
1185 1190 ^ 1195 1200 

Pro Val Glu Asn Leu Gly Thr Thr Met Arg Ser Pro Val Phe Thr Asp 
1205 1210 1215 

Asn Ser Ser Pro Pro Ala Val Pro Gin Ser Phe Gin Val Ala His Leu 
1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr 
1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 
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1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly Val Asp Pro 

1265 1270 1275 1280 

Asn lie Arg Thr Gly Val Arg Thr lie Thr Thr Gly Ser Pro lie Thr 

1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly 

1300 1305 1310 

Ala Tyr Asp lie lie lie Cys Asp Glu Cys His Ser Thr Asp Ala Thr 
1315 1320 1325 

Ser lie Leu Gly lie Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 
1330 1335 1340 

Ala Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 

1345 1350 1355 1360 

Val Ser His Pro Asn lie Glu Glu Val Ala Leu Ser Thr Thr Gly Glu 

1365 1370 1375 

lie Pro Phe Tyr Gly Lys Ala lie Pro Leu Glu Val lie Lys Gly Gly 

1380 1385 1390 

Arg His Leu lie Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 
1395 1400 1405 

Ala Lys Leu Val Ala Leu Gly lie Asn Ala Val Ala Tyr Tyr Arg Gly 
1410 1415 1420 

Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val Val Val Val Ser 

1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val lie 

1445 1450 1455 

Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro 

1460 1465 1470 

Thr Phe Thr He Glu Thr Thr Thr Leu Pro Gin Asp Ala Val Ser Arg 
1475 1480 1485 

Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro Gly He Tyr Arg 
1490 1495 1500 

Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val 
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1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro 

1525 1530 1535 

Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met Asn Thr Pro Gly Leu 

1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Gly Val Phe Thr Gly 

1555 1560 1565 

Leu Thr His lie Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 

1570 1575 1580 

Glu Asn Phe Pro Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 

1585 1590 1595 1600 

Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin Met Trp Lys Cys Leu lie 

1605 1610 1615 

Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu 

1620 1625 1630 

Gly Ala Val Gin Asn Glu Val Thr Leu Thr His Pro lie Thr Lys Tyr 

1635 1640 1645 

lie Met Thr Cys Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp 

1650 1655 1660 

Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser 

1665 1670 1675 1680 

Thr Gly Cys Val Val lie Val Gly Arg He Val Leu Ser Gly Lys Pro 

1685 1690 1695 

Ala He He Pro Asp Arg Glu Val Leu Tyr Gin Glu Phe Asp Glu Met 

1700 1705 1710 

Glu Glu Cys Ser Gin His Leu Pro Tyr He Glu Gin Gly Met Met Leu 

1715 1720 1725 

Ala Glu Gin Phe Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr Ala Ser 

1730 1735 1740 

Arg His Ala Glu Val He Thr Pro Ala Val Gin Thr Asn Trp Gin Lys 

1745 1750 1755 1760 

Leu Glu Val Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly He 
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1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala lie Ala 
1780 1785 1790 

Ser Leu Met Ala Phe Thr Ala Ala Val Thr Ser Pro Leu Thr Thr Gly 
1795 1800 1805 

Gin Thr Leu Leu Phe Asn lie Leu Gly Gly Trp Val Ala Ala Gin Leu 
1810 1815 1820 

Ala Ala Pro Gly Ala Ala Thr Ala Phe Val Gly Ala Gly Leu Ala Gly 
1825 1830 1835 1840 

Ala Ala He Gly Ser Val Gly Leu Gly Lys Val Leu Val Asp He Leu 
1845 1850 1855 

Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu Val Ala Phe Lys lie 
1860 1865 1870 

Met Ser Gly Glu Val Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro 
1875 1880 1885 

Ala He Leu Ser Pro Gly Ala Leu Val Val Gly Val Val Cys Ala Ala 
1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr 
1925 1930 1935 

His Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg Val Thr Ala He Leu 
1940 1945 1950 

Ser Ser Leu Thr Val Thr Gin Leu Leu Arg Arg Leu His Gin Trp He 
1955 1960 1965 

Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp He 
1970 1975 1980 

Trp Asp Trp He Cys Glu Val Leu Ser Asp Phe Lys Thr Trp Leu Lys 
1985 1990 1995 2000 

Ala Lys Leu Met Pro Gin Leu Pro Gly He Pro Phe Val Ser Cys Gin 
2005 2010 2015 

Arg Gly Tyr Arg Gly Val Trp Arg Gly Asp Gly He Met His Thr Arg 
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2020 2025 2030 

Cys His Cys Gly Ala Glu lie Thr Gly His Val Lys Asn Gly Thr Met 
2035 2040 2045 

Arg lie Val Gly Pro Arg Thr Cys Arg Asn Met Trp Ser Gly Thr Phe 
2050 2055 2060 

Pro lie Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro Leu Pro Ala Pro 
2065 2070 2075 2080 

Asn Tyr Lys Phe Ala Leu Trp Arg Val Ser Ala Glu Glu Tyr Val Glu 
2085 2090 2095 

lie Arg Arg Val Gly Asp Phe His Tyr Val Ser Gly Met Thr Thr Asp 
2100 2105 2110 

Asn Leu Lys Cys Pro Cys Gin lie Pro Ser Pro Glu Phe Phe Thr Glu 
2115 2120 2125 

Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro Pro Cys Lys Pro Leu 
2130 2135 2140 

Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu His Glu Tyr Pro Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr 
2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His lie Thr Ala Glu Ala Ala Gly Arg 
2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Met Ala Ser Ser Ser Ala Ser 
2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Thr Ala Asn His Asp 
2210 2215 2220 

Ser Pro Asp Ala Glu Leu lie Glu Ala Asn Leu Leu Trp Arg Gin Glu 
2225 2230 2235 2240 

Met Gly Gly Asn lie Thr Arg Val Glu Ser Glu Asn Lys Val Val lie 
2245 2250 2255 

Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu Asp Glu Arg Glu Val 
2260 2265 2270 

Ser Val Pro Ala Glu lie Leu Arg Lys Ser Arg Arg Phe Ala Arg Ala 
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2275 2280 2285 

Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Thr 
2290 2295 2300 

Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val His Gly Cys Pro Leu 
2305 2310 2315 2320 

Pro Pro Pro Arg Ser Pro Pro Val Pro Pro Pro Arg Lys Lys Arg Thr 
2325 2330 2335 

Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala Leu Ala Glu Leu Ala 
2340 2345 2350 

Thr Lys Ser Phe Gly Ser Ser Ser Thr Ser Gly lie Thr Gly Asp Asn 
2355 2360 2365 

Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys Pro Pro Asp Ser 
2370 2375 2380 

Asp Val Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
2385 2390 2395 2400 

Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val Ser Ser Gly Ala 
2405 2410 2415 

Asp Thr Glu Asp Val Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly 
2420 2425 2430 

Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin Lys Leu Pro lie Asn 
2435 2440 2445 

Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Leu Val Tyr Ser Thr 
2450 2455 2460 

Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys Val Thr Phe Asp Arg 
2465 2470 2475 2480 

Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val Leu Lys Glu Val Lys 
2485 2490 2495 

Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu Ser Val Glu Glu Ala 
2500 2505 2510 

Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser Lys Phe Gly Tyr Gly 
2515 2520 2525 

Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala Val Ala His lie Asn 
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2530 2535 2540 

Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Val Thr Pro lie Asp Thr 
2545 2550 2555 2560 

Thr lie Met Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu Lys Gly 
2565 2570 2575 

Gly Arg Lys Pro Ala Arg Leu lie Val Phe Pro Asp Leu Gly Val Arg 
2580 2585 2590 

Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Ser Lys Leu Pro Leu 
2595 2600 2605 

Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg 
2610 2615 2620 

Val Glu Phe Leu Val Gin Ala Trp Lys Ser Lys Lys Thr Pro Met Gly 
2625 2630 2635 2640 

Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser Asp 
2645 2650 2655 

lie Arg Thr Glu Glu Ala lie Tyr Gin Cys Cys Asp Leu Asp Pro Gin 
2660 2665 2670 

Ala Arg Val Ala lie Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly 
2675 2680 2685 

Pro Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg 
2690 2695 2700 

Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr 
2705 2710 2715 2720 

lie Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr 
2725 2730 2735 

Met Leu Val Cys Gly Asp Asp Leu Val Val lie Cys Glu Ser Ala Gly 
2740 2745 2750 

Val Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr 
2755 2760 2765 

Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu 
2770 2775 2780 

Glu Leu lie Thr Ser Cys Ser Ser Asn Val Ser Val Ala His Asp Gly 
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2785 2790 2795 2800 

Ala Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu 
2805 2810 2815 

Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro Val Asn Ser Trp 
2820 2825 2830 

V 

Leu Gly Asn lie lie Met Phe Ala Pro Thr Leu Trp Ala Arg Met lie 
2835 2840 2845 

Leu Met Thr His Phe Phe Ser Val Leu lie Ala Arg Asp Gin Leu Glu 
2850 2855 2860 

Gin Ala Leu Asn Cys Glu lie Tyr Gly Ala Cys Tyr Ser lie Glu Pro 
2865 2870 2875 2880 

Leu Asp Leu Pro Pro lie lie Gin Arg Leu His Gly Leu Ser Ala Phe 
2885 2890 2895 

Ser Leu His Ser Tyr Ser Pro Gly Glu lie Asn Arg Val Ala Ala Cys 
2900 2905 2910 

Leu Arg Lys Leu Gly Val Pro Pro Leu Arg Ala Trp Arg His Arg Ala 
2915 2920 2925 

Arg Ser Val Arg Ala Arg Leu Leu Ser Arg Gly Gly Arg Ala Ala lie 
2930 2935 2940 

Cys Gly Lys Tyr Leu Phe Asn Trp Ala Val Arg Thr Lys Leu Lys Leu 
2945 2950 2955 2960 

Thr Pro lie Ala Ala Ala Gly Arg Leu Asp Leu Ser Gly Trp Phe Thr 
2965 2970 2975 

Ala Gly Tyr Ser Gly Gly Asp lie Tyr His Ser Val Ser His Ala Arg 
2980 2985 2990 

Pro Arg Trp Phe Trp Phe Cys Leu Leu Leu Leu Ala Ala Gly Val Gly 
2995 3000 3005 

lie Tyr Leu Leu Pro Asn Arg 
3010 3015 



<210> 11 
<211> 24 
<212> DNA 
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<213> Hepatitis C virus 



<400> 11 



actggacacg gaggtggccg cgtc 



24 



<210> 12 
<211> 24 
<212> DNA 

<213> Hepatitis C virus 
<400> 12 

ttgttcttgt cgggttaatg gcgc 24 

<210> 13 
<211> 24 
<212> DNA 

<213> Hepatitis C virus 



<210> 14 
<211> 22 
<212> DNA 

<213> Hepatitis C virus 
<400> 14 

aagcgcccct aacttgatga tg 22 

<210> 15 
<211> 40 
<212> DNA 

<213> Hepatitis C virus 



<210> 16 
<211> 24 
<212> DNA 

<213> Hepatitis C virus 
<400> 16 



<400> 13 



gggtgtacta cacacatgag taag 



24 



<400> 15 



cgtcatcgat acctcagcgg gcatatgcac tggacacgga 



40 
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gtccagtgca tatgcccgct gagg 



24 



<210> 17 
<211> 32 
<212> DNA 

<213> Hepatitis C virus 
<400> 17 

catgcaccag ctgatatagc gcttgtaata tg 32 

<210> 18 
<211> 30 
<212> DNA 

<213> Hepatitis C virus 



<210> 19 
<211> 34 
<212> DNA 

<213> Hepatitis C virus 
<400> 19 

cagaggaggc agggctgcta tatgtggcaa gtac 34 

<210> 20 
<211> 34 
<212> DNA 

<213> Hepatitis C virus 



<210> 21 
<211> 43 
<212> DNA 

<213> Hepatitis C virus 
<400> 21 

cgtctctaga caggaaatgg cttaagaggc cggagtgttt acc 4 3 



<400> 18 



tccgtagagg aagcttgcag cctgacgccc 



30 



<400> 20 

gtacttgcca catatagcag ccctgcctcc tctg 



34 
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<210> 22 
<211> 65 
<212> DNA 

<213> Hepatitis C virus 
<400> 22 

ttatggatgc tcatcttgtt gggccaggcc gaagcagctt tggagaacct cgtaatactc 60 
aatgc 65 

<210> 23 
<211> 32 
<212> DNA 

<213> Hepatitis C virus 



<210> 24 
<211> 50 
<212> DNA 

<213> Hepatitis C virus 
<400> 24 

ttttttttgc ggccgctaat acgactcact atagacccgc ccctaatagg 50 

<210> 25 
<211> 31 
<212> DNA 

<213> Hepatitis C virus 



<210> 26 
<211> 26 
<212> DNA 

<213> Hepatitis C virus 
<400> 26 

ggatgtaccc catgaggtcg gcaaag 26 



<400> 23 



aggatttgtg ctcatggtgc acggtctacg ag 



32 



<400> 25 



ccgtgcacca tgagcacaaa tcctaaacct c 



31 



<210> 27 



<211> 30 
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<212> DNA 

<213> Hepatitis C virus 
<400> 27 

gtttgcgcct gcttatggat gctcatcttg 



<210> 28 
<211> 26 
<212> DNA 

<213> Hepatitis C virus 
<400> 28 

gcgtcataag catatgcctg ttgggg 



<210> 29 
<211> 23 
<212> DNA 

<213> Hepatitis C virus 
<400> 29 

ccctcagcac tggagtacat ctg 



<210> 30 
<211> 39 
<212> DNA 

<213> Hepatitis C virus - 
<400> 30 

cgtcatgcat acccctaggg cggctctcat tgaagaggg 



<210> 31 
<211> 30 
<212> DNA 

<213> Hepatitis C virus 
<400> 31 

cgtcccctct tcaatgagag ccgctctaga 



<210> 32 
<211> 28 
<212> DNA 

<213> Hepatitis C virus 



82 



WO 00/75338 PCT7US00/15446 

<400> 32 

gcggtgaaga ccaagctcaa actcactc 28 



<210> 33 
<211> 41 
<212> DNA 

<213> Hepatitis C virus 
<400> 33 

aatctagaag gcgcgcttcc ggcaatggag tgagtttgag c 41 



<210> 34 
<211> 38 
<212> DNA 

<213> Hepatitis C virus 
<400> 34 

cgtctctaga ggataaatcc aggaggcgcg cttccggc 3 8 



<210> 35 
<211> 27 
<212> DNA 

<213> Hepatitis C virus 
<400> 35 

tactttttgt aggggtaggc cttttcc 2 7 



<210> 36 
<211> 34 
<212> DNA 

<213> Hepatitis C virus 
<400> 36 

cgtctctaga gtgtagctaa tgtgtgccgc tcta 34 



<210> 37 
<211> 24 
<212> DNA 

<213> Hepatitis C virus 
<400> 37 

ctatggagtg tagctaatgt gtgc 24 
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<210> 38 
<211> 66 
<212> DNA 

<213> Hepatitis C virus 
<400> 38 

cgtctctaga catgatctgc agagagacca gttacggcac tctctgcagt catgcggctc 60 
acggac 66 

<210> 39 

<211> 41 

<212> DNA 

<213> Hepatitis C virus 



<400> 39 



ctttcacagc tagccgtgac tagggctaag atggagccac c 



41 
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